(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
27 September 2001 (27.09.2001) 




PCT 



(10) International Publication Number 

WO 01/70808 A2 



(51) International Patent Classification 7 : C07K 14/47 
C12N 15/12, 15/11, A61P 35/00, 9/10, C12N 5/10 CVQ 
1/68, A01K 67/027 

(21) International Application Number: PCT/US0 1/09609 

(22) International Filing Date: 22 March 2001 (22.03:2601) 

(25) Filing Language: Eng]ish 

(26) Publication Language: English 

(30) Priority Data: 

60/191,134 22 March 2000 (22.03.2000) US 

(71) Applicants (for all designated States except US): CURA- 
GEN CORPORATION [US/US]; 11th Floor, 555 Long 
Wharf Drive, New Haven, CT 06511 (US). GENEN- 

; TECH, INC. [US/US]; 1 DNA Way, South San Francisco 
CA 94080 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): RASTELLI, Luca, 
K. [US/US]; 52 Pepperbush Lane, Guilford, CT 06437 
(US). GERRITSEN, Mary [US/US]; -. 



(74) Agent: RAUCH, Paul, E.; Brinks Hofer Gilson & Done, 
P.O. Box 10087, Chicago, IL 606 10 (US). 

(81)..Designated States (national): AE, AG, AL, AM, AT AU 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH f CN, CO, CR CU 
CZ, DE, DK, DM, DZ, EE, ES, FT, GB t GD, GE, GH,W 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK 
LR, LS, LT, LU, LV, MA, MD, MG, MX, MN, MW Mx' 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK SL 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, W 

(84) Designated States (regional): ARIPO patent (GH GM 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, H, FR, GB, GR IE 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ CR 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). ' 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

00 
00 



» (54) Title: ANGIOGENICS ASSOCIATED PROTEINS, AND NUCLEIC ACIDS ENCODING THE SAME 

£ ir^r? ^W™* ^"B at ,easl *> % sequence identity to the sequence SEQ ID NOS: 2 4 6 8 10 12, 14 

^ or 16, and polynucleotides encoding the same, are useful for modulating angiogenesis * 



WO 01/70808 



PCTAJSOl/09609 



ANGIOGENISIS ASSOCIATES PROTEINS, AND NUCLEIC ACIDS 
ENCODING THE SAME 

Related Applications 

This application claims priority to U.S. provisional application Serial No 
60/191,134 filed 03/22/2000. 

Background 

Cities have roads and alleys, plants have xylem and phloem, and people have 
arteries, veins and lymphatics. Without these byways, the vertebrate animal cells would 
starveor drown in their metabolic refuse. Not only do blood vessels deliver food and 
oxygen and carry away metabolic wastes, but they also transport signaling substances that 
appnse cells of situations remote to them but to which they need to respond. Hormonal 
messages are a common signal. 

All blood vessels are ensheathed by a basal lamina and a delicate monolayer of 
remarkably plastic endothelial cells lining the luminal walls. Depending on location and 
function, smooth muscle and connective tissue may also be present. 

Not only do healthy cells depend on the blood resources transported by the 
curatory system, but so, too, unwanted cells: tumorigenic and malignant cells These 
cells colonize and proliferate if they are able to divert blood resources to themselves 
Anagenesis, the type of blood vessel formation where new vessels emerge from the 
prohferation of preexisting vessels (Risau, 1995; Risau and Flamme, 1995), is exploited 
not onlybyusual processes, such as in wound healing or myocardial infarction repair but 
also by tumors themselves and in cancer,, diabetic retinopathy, macular degeneration, 
psonasis, and rheumatoid arthritis. Regardless of the process, whether pathological or 
usual physiological, endothelial cells mediate anagenesis in a multi-step fashion- (1) 
endothelia receive an extracellular cue, (2) the signaled cells breach the basal lamina 
sheath, abetted by proteases they secrete, (3) the cells then migrate to the signal and 
proliferate, and finally, (4) the cells form a tube, a monogenic event (Alberts * al 
1994). The complexity of this process indicates complex changes in cellular physiology 
and morphology, gene expression, and signaling. Angiogenic accomplices that are cues 
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include basic fibroblast growth factors (bFGF), angiopoietins (such as ANGl) and 
various forms of vascular endothelial growth factor (VEGF). 

The molecular events and the order in which they occur and the pathways that ar, 
required for this process are of fundamental importance to understand angiogenesis In 
vitro models are useful for identifying alterations in gene expression that occur during 
angiogenesis. A particularly fruitful model systems involves the suspension in a three- 
dimensional type I collagen gel and various stimuli, such as phorbol myristate acetate 
(PMA), basic fibroblast growth factor (bFGF), and VEGF. The combination of the 
stimuli and the collagen gel results in the formation of a three-dimensional tubular 
network of endothelial cells with interconnecting lumenal structures. In this model 
endothelial differentiation into tubelike structures is completely blocked by inhibitors of 
new mRNA or protein synthesis. Furthermore, the cells progress through differentiation 
in a coordinated and synchronized manner, thus optimizing the profile of gene expression 
(Kahn et al., 2000; Yang et al, 1999). 

Tumor cells exploit angiogenesis to facilitate tumor growth. Controlling 
angiogenesis, by controlling the activity or expression of genes and proteins associated 
with angiogenesis, provides a way to prevent tumor growth, or even destoy tumors'. 



Summary 



The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences. AAP, or angiogenesis 
associated polypeptides (AAP) comprises kelch-like polypeptide (KLP), human ortholog 
of mouse BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG 
(hTRG), human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone 
receptor (NHR), and human mitochondrial protein (hMP). 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences." 



In a first aspect, the present invention is an isolated polypeptide having at least 
80% sequence identity to the sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, 
polynucleotides encoding the same, and antibodies that specifically bind the same. 

- In a second aspect, the present invention is an isolated polynucleotide having at 
least 80% sequence identity to the sequence SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13 or 15, or a 
complement thereof. 

In a third aspect, the present invention is a transgenic non-human animal, having j 
disrupted AAP gene or a transgenic non-human animal expressing an exogenous 
polynucleotide having at least 80% sequence identity to the sequence SEQ ID NOS:l, 3, 
5, 7, 9, 1 1, 13 or 15, or a complement of said polynucleotide. 

In a fourth aspect, the present invention is a method of screening a sample for a 
mutation in an AAP gene. 

In a fifth aspect, the present invention is a method of modulating angiogenesis 
comprising modulating the activity of at least one AAP polypeptide. 

In a sixth aspect, the present invention is a method of increasing, as well as 
decreasing angiogenesis, comprising modulating the activity of at least one AAP 
polypeptide. Activity modulation of AAP polypeptides may be over-expressing or 
eliminating expression of the gene, or impairing a AAP polypeptide's function by contact 
with specific antagonists or agonists, such as antibodies or aptamers. 

In a seventh aspect, the present invention is a method of treating various 
pathologies, including tumors, cancers, myocardial infarctions and the like. 

In an eighth aspect, the present invention is a method of measuring a AAP 
transcriptional and translational up-regulation or down-regulation activity of a compound. 

In a ninth aspect, the invention is a method of screening a tissue sample for 
tumorigenic potential. 

In a tenth aspect, the invention is a method of determining the clinical stage of 
tumor which compares the expression of at least one AAP in a sample with expression of 
said at least one gene in control samples. 

Although methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, suitable methods and 
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materials are described below. Li addition, the materials, methods, and examples are 
illustrative only and not intended to be limiting. 

Detailed description 

A model of angiogenesis-the suspension of endothelial cells in type I collagen gels 
with various stimuli-was used to identify a molecular fingerprint or transcriptional profile 
of endothelial differentiation into nmelike structures, using amplification and an imaging 
approach called GeneCalling (Shimkets et al, 1999). This method was previously shown 
to provide a comprehensive sampling of cDNA populations in conjunction with the 
sensitive detection of quantitative differences in mRNA abundance for both known and 
novel genes. Many differentially expressed cDNA fiagments were found. The 
identification and differential expression of these gens was confirmed by a second 
independent method employing real-time quantitative polymerase chain reaction (PCR). 
Although some of the identified cDNA fragments were genes known to play some role in 
angiogenesis, many other differentially expressed genes were unexpected. The inventors 
have identified the unexpected genes and polypeptides that are expressed in response to 
this model of angiogenesis, collectively refered to as angiogenesis associated 
polypeptides (AAP). AAP are kelch-like polypeptide (KLP), human ortholog of mouse 
BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG (hTRG), 
human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone receptor 
(NHR), and human mitochondrial protein (hMP). ' 

Definitions 

Unless defined otherwise, all technical and scientific terms have the same 
meaning as is commonly understood by one of skill in the art to which this invention 
belongs. The definitions below are presented for clarity. 

The recommendations of (Demerec et al, 1966) where these are relevant to 
genetics are adapted herein. To distinguish between genes (and related nucleic acids) and 
the proteins that they encode, the abbreviations for genes are indicated by italicized (or 
underlined ) text while abbreviations for the proteins start with a capital letter and are not 
italicized. Thus, AAP or AAP refers to the nucleotide sequence that encodes AAP. 
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"Isolated," when referred to a molecule, refers to a molecule that has been 
identified and separated and/or recovered from a component of its natural environment 
Contaminant components of its natural environment are materials that interfere with 
diagnostic or therapeutic use. 

"Container" is used broadly to mean any receptacle for holding material or 
reagent Containers may be fabricated of glass, plastic, ceramic, metal, or any other 
material that can hold reagents. Acceptable materials will not react adversely with the 
contents. 



1. Nucleic acid-related definitions 

(a) control sequences 

Control sequences are DNA sequences that enable the expression of an operably- 
linked coding sequence in a particular host organism Prokaryotic control sequences 
include promoters, operator sequences, and ribosome binding sites. Eukaryotic cells 
utilize promoters, polyadenylation signals, and enhancers. 

(b) operably-linked 

Nucleic acid is operably-linked when it is placed into a functional relationship 
with another nucleic acid sequence. For example, a promoter or enhancer is operably- 
linked to a coding sequence if it affects the transcription of the sequence, or a ribosome- 
binding site is operably-linked to a coding sequence if positioned to facilitate translation. 
Generally, "operably-linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, 
enhancers do not have to be contiguous. Linking is accomplished by conventional 
recombinant DNA methods. 

(c) isolated nucleic acids 

An isolated nucleic acid molecule is purified from the setting in which it is found 
in nature and is separated from at least one contaminant nucleic acid molecule. Isolated 
AAP molecules are distinguished from the specific AAP molecule, as it exists in cells. 
However, an isolated AAP molecule includes AAP molecules contained in cells that 
ordinarily express an AAP where, for example, the nucleic acid molecule is in a 
chromosomal location different from that of natural cells. 
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2. Protein-related definitions 
(a) purified polypeptide 

When the molecule is a purified polypeptide, the polypeptide will be purified (1) 
to obtain at least 15 residues of N-terminal or internal amino acid sequence using a 
sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 
conditions using Coomassie blue or silver stair, Isolated polypeptides include those 
expressed heterologously in geneticaUy-engineered cells or expressed in vitro, since at 
least one component of an AAP natural environment will not be present Ordinarily, 
isolated polypeptides are prepared by at least one purification step. 

(b) active polypeptide 

An active AAP or AAP fragment retains a biological and/or an immunological 
activity of the native or naturally-occurring AAP. Immunological activity refers to the 
ability to induce the production of an antibody against an antigenic epitope possessed by a 
nahve AAP; biological activity refers to a function, either inhibitory or stimulatory 
caused by a native AAP that excludes immunological activity. A biological activity of 
AAP includes, for example, modulating angiogenesis. 

(c) Abs 

Antibody may be single anti-AAP monoclonal Abs (including agonist, antagonist 
and neutralizing Abs), anti-AAP antibody compositions with polyepitopic specificity 
single chain anti-AAP Abs, and fragments of anti-AAP Abs. A "monoclonal antibody- 
refers to an antibody obtained from a population of substantially homogeneous Abs, i e 
the individual Abs comprising the population are identical except for naturally-occurring 
mutations that may be present in minor amounts 

(d) epitope tags 

An epitope tagged polypeptide refers to a chimeric polypeptide fused to a 'lag 
polypeptide". Such tags provide epitopes against which Abs can be made or are 
available, but do not interfere wjth polypeptide activity. To reduce anti-tag antibody 
reactivity with endogenous epitopes, the tag polypeptide is preferably unique. Suitable tag 
polypeptides generally have at least six amino acid residues and usually between about 8 
and 50 amino acid residues, preferably between 8 and 20 amino acid residues). Examples 
of epitope tag sequences include HA from Influenza A virus and FLAG. 
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The novel AAP of the invention Mode the nucleic acid, whose sequences am 
provided inTables .,3,5,7, 9, „, 13and ,5, or a fragment thereof The invention also 
md^esamntotorvarian^.^of^^^^^^^^ 

corresponding base shown in TaMes f, 3, 5, 7, 9, ,1, !3 and 15 whUeaiB encodinga 

iagmen, of such a nucleic acid. The invention further includes nucleic acids whose 
sequences are complement to mose jus, described, including conmtaentery nudeic 
actd fragments. The invention addi«o nallyincllldKnucleica( . dsoriiiicle . cadd 

fragments, or complements thereto, whose shuctures include chemical modifications. 

learned „„, a, ieas. in part to enhance the chemical stability of the modified nucieic 
aad, such tha, they may be used, for example, as anti-sense binding nudeic acids in . 
therapeutic applications in a subject In the mutan, or variant nudeic adds, and their 
complements, up to 20% or more of the bases may be so changed 

The novel AAP of me invention include the protein fragments whose sequences 
*reprov,dedinTab,es2,4,«, 8 , ,„, I2> Umi 16 . TheinvenfionaJsoindudesanAAP 
muster variant protein, ar,y of whose residues maybe changed from the corresponding 

mamtams its native activities and physiological functions, or a functional fragment 
•hereof I„ me m«an, or variant AAP, up to 20% or more of me residues may be so 

TT* ^ Vma0 ° fMera,C ° mPaSSKAbSandan,ib0dyfc '^-">-^or 
(!■*) a, that bmd immunospecificaUy to any of the AAP of the invention 

The AAP nucleic ados are shown in Tables 1, 3, 5, 7, 9, 1 1,13 ar«i 15 , and (he 

Star, and stop codons in mepofynudeotide sequences are indicated in boldface and with 
m^sm SEQfDNOJIacksastopcodon. The sequences of HWC, and Mm do 
nothavestartcodonsCseeTabfe fT); consequent,* hMX, and hMX2 peptides do no, 
start wtm aMet For any lacking nucleotide sequence, one of slriU in the art may 
~e„u I ,e„gmsequenc«„y,for ^ probing cDNA or genomic libraries 
witt, probes designed according to the sequences of the instant invention 
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Table 1 



ctggcctaga 
acataggatg 
caattgcaga 
taacacagga 
aatgtgtcag 
tgaagtattc 
attaatggaa 
tgtcattgtt 
taattgttga 
atttcagggc 
ggaatgtcga 
caatgaatga 
atgtgttaca 
gattgttgag 
tggtggagca 
acctactgat 
gagaagctgc 
ctgttcttag 
aaggtcitgcc 
agtttttcaa 
cttcagaaaa 
tttacaagtt 
ataatgatat 
gtaaaacaag 
cacagcaaaa 
tggtttgctg 
atcggaggac 
taccttgtgc 
cactgaacct 
gacagactag 
gagggttgca 
cttcagtaac 
tccctgctaa 
gtgtgtttat 
acctggaact 
tggggagaga 
catggaaacc 



tactacaact 
ttgccacaaa 
tttgagcaca 
tgacctagga 
tcttggtact 
acttgacctt 
tacagtc abg 
ggaacagttg 
gggcactgaa 
catgtttatg 
tgctgccacc 
cagcactgta 
acgttgtcga 
ttttgctgat 
caagttcact 
agatattctc 
tatgctgtgg 
ccaaatcaga 
acccaatgat 
accaagactt 
tccttgtagt 
atgtagccca 
ctacatagca 
caaacttcag 
tacctggttt 
tgaaggctat 
cgtagaaaga 
ttggcaatgg 
catgtactgt 
taggtccttt 
tattgctacc 
tgtggaaatt 
gaggtactct 
gcgagaaacc 
tgaccggtgg 
ttttcgatgc 
accaacttat 



KLP nucleotide sequence (SEQ ID NO:l) 



gaacEEtctc tctttttagt tactccacag gatccgctga io" 

atctacctcg tgtatttttc tctttcactc atgagctgca 120 

atgtctgcag actgtgttga aaaactctga agaacctaat i 80 

gtgattctaa gtctgtgtaa caagatatta ctcattagtg 240 

gaatgctgca gataacagca agtaggttct cctttatttc 3 00 

ccatcagtaa gacggacttt tctaatctgt tcctggagat 360 

tccactcaag acgagaggca gatcaatact gaatatgctg 420 

aaactgtttt atgaacagca gttgtttact gacatagtgt 480 

ttcccttgtc ataagatggt tcttgcaaca tgtagctctt 540 

agtggactaa gtgaaagcaa acaaacccat gtacacctga 600 

ttacagataa taataactta tgcatacacg ggtaacttgg 660 

gaacagcttt atgaaacagc ttgcttccta caggtagaag 720 

gaatatttaa ttaaaaaaat aaatgcagag aattgtgtac 78 o 

ctcttcagtt gtgaggaatt aaaacagagt gctaaaagaa 840 

gctgtgtatc atcaggacgc gttcatgcag ctgtcacatg 900 

agtagtgaca atttaaatgt agaaaaggaa gaaaccgttc 960 

ctagagtata acacagaatc acgatcccag tatttgtctt 1020 

attgatgcac tfctcagaagt aacacagaga gcttggtttc 108O 

aagtcagtgg tggttcaagg tctgtataag tccatgccca n 40 

gggatgacta aagaggaaat gatgattttc attgaagcat 12 00 

ctttactctt ctgtctgtta cagcccccaa gcagaaaaag i 260 

ccagctgatt tgcataaggt tgggaccgtt gtaactcctg 1320 

gggggtcaag ttcctctgaa aaacacaaaa acaaatcaca 1380 

actgccttca gaactgtgaa ttgcttttat tggtttgatg i 440 

ccaaagaccc caatgctttt tgtccgcata aagccatctt 1500 

atctatgcaa ttggaggaga tagcgtaggt ggagaactta 1560 

tacgacactg agaaagatga gtggacgatg gtaagccctt 1620 

agtgcagcag ttgtggttca tgactgcatt tatgtgatga i 680 

tattttccaa ggtctgactc atgggtagaa atggccatga i 740 

gcttcagctg cagcttttgg tgataaaatt ttctatattg i 800 

aattccggca taagactccc ctctggcact gtagatgggt i 860 

tatgatgtga ataaaaatga gtggaaaatg gcagccaaca 1920 

gacccctgtg ttagagctgt tgtgatctca aattctctat i 980 

cacttaaatg agcgagctaa atacgtcacc taccaatatg 2040 

tctctgcggc agcatatatc tgaacgtgta ctgtgggact 2100 

actgtgggga aactctatcc atcctgcctt gaagagtctc 2160 

cttttttcaa cggatgggac agaagagttt gaactggatg 2220 
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gagaaatggt tgcactacca cctgtatagt ggggaagttc agggagtgca cgcctgigtt 228T 

atgtgctttg tcattttctt tgctaaacaa aagaggctat gaaagaacta aatatgagta 2340 
cataaaattc tatctttgat aaattttatt tttatgccct acttaatatt tgcatcagta 2400 
taatatatat cagtgagtct tacagaaaga tatgcttcca taatatgaaa tagattattc 2460 
aataattgag aaactttatg tgtaatcatg agagtataag aatctggatt atctaacatt 2520 
gttagccctg tgtatgtaca gttcaaaaag ttcatttata aaagtagttt cctgttc 2577 



Table 2 KLP polypeptide sequence (SEQ ID NO:2) 



Met Ser Thr Gin Asp Glu Arg Gin lie Asn Thr Glu Tyr Ala Val Ser 
5 10 15 

Leu Leu Glu Gin Leu Lys Leu Phe Tyr Glu Gin Gin Leu Phe Thr Asp 
20 25 30 

He Val Leu He Val Glu Gly Thr Glu Phe Pro Cys His Lys Met Val 
S 40 45 

Leu Ala Thr Cys Ser Ser Tyr Phe Arg Ala Met Phe Met Ser Gly Leu 

55 60 

Ser Glu Ser Lys Gin Thr His Val His Leu Arg Asn Val Asp Ala Ala 

Thr Leu Gin He lie lie Thr Tyr Ala Tyr Thr Gly Asn Leu Ala Met 
85 9° 95 

Asn Asp ser Thr Val Glu Gin Leu Tyr Glu Thr Ala Cys Phe Leu Gin 
100 110 

Val Glu Asp Val Leu Gin Arg Cys Arg Glu Tyr Leu He Lys Lys He 
H5 120 125 

Asn Ala Glu Asn Cys Val Arg Leu Leu Ser Phe Ala Asp Leu Phe Ser 

135 140 

Cys Glu Glu Leu Lys Gin Ser Ala Lys Arg Met Val Glu His Lys Phe 
150 155 iso 

Thr Ala val Tyr His Gin Asp Ala Phe Met Gin Leu Ser His Asp Leu 
165 170 i 7 5 

Leu He Asp lie Leu Ser Ser Asp Asn Leu Asn Val Glu Lys Glu Glu 

180 185 190 

Thr Val Arg Glu Ala Ala Met Leu Trp Leu Glu Tyr Asn Thr Glu 



195 200 



Ser 



205 



Arg Ser Gin Tyr Leu Ser Ser Val Leu Ser Gin He Arg He Asp Ala 

215 220 
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Leu Ser Glu Val Thr Gin Arg Ala Trp Phe Gin Gly Leu Pro Pro Asn 
225 230 235 

Asp Lys Ser Val Val Val Gin Gly Leu Tyr Lys Ser Met Pro Lys Phe 
24 5 250 



255 



Phe Lys Pro Arg Leu Gly Met Thr Lys Glu Glu Met Met lie Phe lie 
260 265 270 

Glu Ala Ser Ser Glu Asn Pro Cys Ser Leu Tyr Ser Ser Val Cvs Tvr 
275 280 285 

Ser Pro Gin Ala Glu Lys Val Tyr Lys Leu Cys Ser Pro Pro Ala Asp 
290 295 300 P 

Leu His Lys Val Gly Thr Val Val Thr Pro Asp Asn Asp He Tyr He 



310 



315 320 
Ala Gly Gly Gin Val Pro Leu Lys Asn Thr Lys Thr Asn His Ser Lys 



325 



330 



335 



Thr Ser Lys Leu Gin Thr Ala Phe Arg Thr Val Asn Cys Phe Tyr Trp 
340 345 



350 

Phe Asp Ala Gin Gin Asn Thr Trp Phe Pro Lys Thr Pro Met Leu Phe 
355 3 6 o 365 

Val Arg He Lys Pro Ser Leu Val Cys Cys Glu Gly Tyr He Tyr Ala 
370 375 380 

lie Gly Gly Asp Ser Val Gly Gly Glu Leu Asn Arg Arg Thr Val Glu 
385 390 395 400 

Arg Tyr Asp Thr Glu Lys Asp Glu Trp Thr Met Val Ser Pro Leu Pro 
405 4io 415 

Cys Ala Trp Gin Trp Ser Ala Ala Val Val Val His Asp Cys He Tyr 



425 



430 



Val Met Thr Leu Asn Leu Met Tyr Cys Tyr Phe Pro Arg Ser Asp Ser 



440 



445 



Trp Val Glu Met Ala Met Arg Gin Thr Ser Arg Ser Phe Ala Ser Ala 
450 455 4so 

Ala Ala Phe Gly Asp Lys He Phe Tyr He Gly Gly Leu His He Ala 
465 470 475 48 o 

Thr Asn Ser Gly He Arg Leu Pro Ser Gly Thr Val Asp Gly Ser Ser 



485 



490 



495 



Val Thr Val Glu He Tyr Asp Val Asn Lys Asn Glu Trp Lys Met Ala 



500 



505 



510 



Ala Asn He Pro Ala Lys Arg Tyr Ser Asp Pro Cys Val Arg Ala Val 



515 



520 



525 



Val He Ser Asn Ser Leu Cys Val Phe Met Arg Glu Thr His Leu Asn 
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530 531 ^ 

Glu Arg Ala Lys Tyr Val Thr Tyr Gin Tyr Asp Leu Glu Leu Asp Arg 

550 555 

Trp Ser Leu Arg Gin His lie Ser Glu Arg Val Leu Trp Asp Leu Gly 
565 570 5?5 

Arg Asp Phe Arg Cys Thr Val Gly Lys Leu Tyr Pro Ser Cys Leu Glu 



580 



585 



590 



Glu Ser Pro Trp Lys Pro Pro Thr Tyr Leu Phe Ser Thr Asp Gly Thr 
595 eoo 605 

Glu Glu Phe Glu Leu Asp Gly Glu Met Val Ala Leu Pro Pro Val 
610 615 620 



Table 3 hBAZF nucleotide sequence (SEQ ID NO:3) 
caagggagcg agggtgtcgt agagggcaga atgaacaaga agaattagga gggaggctgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



gtgtgccggg gctaggggct ggaagtcctg gctctagttg cacctcggaa ggaaaaggca 
aacagaggag ggaaggcgtc ttaggactgc ctggatccag agcactttcc tcggcctcta 
caggcctgtg tcgctatjjgg ttcccccgcc gccccggagg gagcgctggg ctacgtccgc 
gagttcactc gccactcctc cgacgtgctg ggcaacctca acgagctgcg cctgcgcggg 
atcctcactg acgtcacgct gctggttggc gggcaacccc tcagagcaca caaggcagtt 
ctcatcgcct gcagtggctt cttctattca attttccggg gccgtgcggg agtcggggtg 
gacgtgctct ctctgcccgg gggtcccgaa gcgagaggct tcgcccctct attggacttc 
atgtacactt cgcgcctgcg cctctctcca gccactgcac cagcagtcct agcggccgcc 
acctatttgc agatggagca cgtggtccag gcatgccacc gcttcatcca ggccagctat 
gaacctctgg gcatctccct gcgcoccctg gaagcagaac ccccaacacc cccaacggcc 660 
cctccaccag gtagtcccag gcgctccgaa ggacacccag acccacctac tgaatctcga 
agctgcagtc aaggcccccc cagtccagcc agccctgacc ccaaggcctg caactggaaa 
aagtacaagt acatcgtgct aaactctcag gcctcccaag cagggagcct ggtcggggag 
agaagttctg gtcaaccttg cccccaagcc aggctcccca gtggagacga ggcctccagc 
agcagcagca gcagcagcag cagcagcagt gaagaaggac ccattcctgg tccccagagc 
aggctctctc caactgctgc cactgtgcag ttcaaatgtg gggctccagc cagtaccccc 
tacctcctca catcccaggc tcaagacacc tctggatcac cctctgaacg ggctcgtcca 
ctacccggga gtgaattttt cagctgccag aactgtgagg ctgtggcagg gtgctcatcg u 4 0 
gggctggact ccttggttcc tggggacgaa gacaaaccct ataagtgtca gctgtgccgg i 20 0 
tcttcgttcc gctacaaggg caaccttgco agtcaccgta cagtgcacac aggggaaaag 1260 
ccttaccact gctcaatctg cggagcccgt tttaaccggc cagcaaacct gaaaacgcac 
agccgcatcc attcgggaga gaagccgtat aagtgtgaga C gtg C ggctc gcgctttgta 



720 
780 
840 
900 
960 
1020 
1080 



1320 
1380 
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1440 
1500 



caggtacgga gccagcctcc aagtggcttc caaggcaaac c tgcaagagg tggggtgggc " 
caaaagggag ggttctgttc ctcccagagg caggacttga agtctcctcc ctcccaggtg 

gcacatctgc gggcgcacgt gctgatccac accgggg aga agccctaccc ttgccctacc Z7o 

tgcggaaccc gcttccgcca cctgcagacc ctcaagagcc acgttcgcat ccacaccgga 1620 

3 agaagcctt accactgcga cccctgtggc ctgcatttcc ggcacaagag tcaactgcgg 1680 

ctgcatctgc gccagaaaca cggagctgct accaacacca aagtgcacta ccacattctc 1740 

ggggggccc 

1749 



Table 4 hBAZF polypeptide sequence (SEQ ID NO:4) 



T ^ Pr ° f 3 "'"^W a Leu Gly Tyr Val l^o^ 

10 15 
Phe Thr Arg His fear Ser Asp Val Leu Gly Asn Leu Asn Glu Leu Arg 

25 30 
Leu Arg Gly He Leu Thr Asp Val Thr Leu Leu Val Gly Gly Gin Pro 

40 45 
Leu Arg Ala His Lys Ala Val Leu He Ala Cys Ser Gly Phe Phe Tyr 



55 60 



Ser lie Phe Arg Gly Arg Ala Gly Val Gly Val Asp Val Leu Ser Leu 



75 



80 

Pro Gly Gly Pro Glu Ala Arg Gly Phe Ala Pro Leu Leu Asp Phe Met 

90 95 

Tyr Thr Ser Arg Leu Arg Leu Ser Pro Ala Thr Ala Pro Ala Val Leu 

105 110 

Ala Ala Ala Thr Tyr Leu Gin Met Glu His Val Val Gin Ala Cys His 

120 125 

Arg Phe He Gin Ala Ser Tyr Glu Pro Leu Gly He Ser Leu Arg Pro 

135 140 
Leu Glu Ala Glu Pro Pro Thr Pro Pro Thr Ala Pro Pro Pro Gly Ser 
150 155 160 

Pro Arg Arg Ser Glu Gly His Pro Asp Pro Pro Thr Glu Ser Arg Ser 
165 17 ° 175 

Cys Ser Gin Gly Pro Pro Ser Pro Ala Ser Pro Asp Pro Lys Ala Cys 

185 lgo 

Asn Trp Lys Lys Tyr Lys Tyr He Val Leu Asn Ser Gin Ala Ser Gin 
135 200 205 

Ala Gly ser Leu Val Gly Glu Arg Ser Ser Gly Gin Pro Cys Pro Gin 



215 



220 
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225 



235 



Ser Ser Ser ser Ser Glu Glu Oly Pro Ile Pro Gly PrQ 



240 



250 



Gin Ser Arg 



255 



Leu Ser Pro Thr Ala Ala Thr Val rin r 

260 31 JJJ Phe L ^ s ^ Gly Ala Pro Ala 



Ser Thr Pro Tyr Leu Leu Thr Ser Gin Ala 

*5 / 5 



280 



Pro Ser Glu Arg Ala Arg Pro Leu Pro Gly 



295 



270 

Gin Asp Thr Ser Gly Ser 
285 

Ser Glu Phe Phe Ser Cys 
300 



«J «- ay, 01n Ma v, ; M , Gly ^ Sm sm ^ ^ ^ ^ ^ 

315 320 
v.! Gly asp s „ p Lys p „ ^ ^ ^ ^ ^ ^ ^ 



Ser Phe Arg Tyr Lys Gly Asn Leu Ala 



335 



340 



345 



Ser His Arg Thr Val His Thr 



350 



«y «» «. p» ^ Ms ^ Ile ^ My Ma ^ ^ ^ 

Jt>U 365 



Pro Ala Asn Leu Lys Thr His 



370 



375 



Ser Arg lie His Ser Gly Glu Lys Pro 



Lys cys Glu Thr Cys Gly Ser Arg Phe 



390 



380 



Val Gin Val Arg Ser Gin 
395 400 



*» *. - My jj. eia Gly Lys p „ ^ Giy oiy vsi My oin 

0 415 
«T =1, ». Sor s « oln Qla ^ ^ ^ ^ ^ 

" 5 430 

Ser Gin Val Ala His Leu Arg Ala His u a l r« Tn 

435 9 JJ* Hls Val L eu He His Thr Gly Glu 

** U 445 

- J.. ^ «. ^ Pro ^ 01y ^ ^ phe ^ ^ 

3:5 460 
* - *s Ser Hls jjj ^ Ue Hls ^ Gly ^ ^ ^ Ms 



475 



480 



«*. s Leu Hls phe ^ Lya ser cin Leu ^ ^ 

«. u, *, J. tys Hi6 0ly Ma ^ ^ lys vai 2 ^ 

510 

His lie Leu Gly Gly Pro 
515 



WO 01/70808 



-14- 



PCT/US01/09609 



Table 5 



tctttttcct 
tttgaccgct 
cgcgtgctct 
cggaagggcc 
atggtcttca 
cattgattct 
aaagatgcat 
agagagacaa 
caatattaac 
cctgagagtg 
gaccatgact 
caaattggac 
aaatcataat 
tgtagatctt 
atatggtgag 
tgaatgtgtt 
ctcgatttct 
tgtatttttg 
agaatacctc 
ctcaaaagag 
aggcctggct 
tcagggagag 
gttgcaacgg 
tgccggagac 
caaagccaac 
agcaatgaag 
tacaagagaa 
atctggaatg 
ctgtccttgt 
cccgtttgac 
aggtgtcctg 
attcggatca 
ctgcgagaag 
tggagcacac 
tcttaaacaa 
agttgtagct 



cgcgtccttt 
tcccggtgcg 
gcgcttgcca 
cccgcctccc 
tcaggggtga 
gggaaaacta 
gaggtgaaag 
agaggaatca 
attatagata 
ttggatggtg 
gtcaatcgtc 
cgaatgggct 
acagcgttta 
attgaggaac 
attccagctg 
gccaattcag 
gatttaaagc 
ggaagcgcct 
ccaaatccat 
aaaaccaaaa 
tttcccctgg 
ctaaagaagg 
ctggctcgca 
atctgtgcat 
agcggccttt 
ccttctaaca 
gatcccacat 
ggagaattac 
atcacaggaa 
tttacacata 
gagcctctgg 
aatattccaa 
ggccctcttt 
cacatggttg 
gccttggcaa 
ccaaatgaat 



hEF-G nucleotide sequence (SEQ ID NO:5) 



yccccggaag tgctcttaca acattggct g ccggcgtga c 
ttaccggcag ctgaacccac ccggcgccac gggactttga 
tgagactcct gggagctgca gccgtcgcgg ctctggggcg 
taggctggca gaggaagcag gttaattgga aggcctgccg 
ttcctaatga aaaaatacga aatattggaa tctcagctca 
cattaacaga acgagtcctt tactacactg gcagaattgc 
gtaaagatgg agttggtgct gtcatggatt ccatggaact 
ctattcagtc agcagccact ttcaccatgt ggaaagatgt 
ctcctgggca tgtggacttc acaatagaag tggaaagggc 
cagtccttgt tctctgtgct gttggagggg tacagtgcca 
agatgaagcg ctacaacgfct ccgtttctaa cttttattaa 
ccaacccagc cagggccctg cagcaaatga ggtctaaact 
tgcagatacc catgggtttg gagggtaatt ttaaaggtat 
gagccatcta ttttgatgga gactttagtc agattgttcg 
aattaagggc ggcggccact gaccaccggc aggagctaat 
atgaacagct tggtgagatg tttctggaag aaaaaatccc 
tagcaattcg aagagctact ctgaaaagat catttactcc 
tgaagaacaa aggagttcag cctcttttag atgctgtttt 
ctgaagtcca gaactatgct attctcaata aaaaggatga 
tcctaatgaa ctccagtaga cacaattccc acccatttgt 
aggtaggtcg atttggacaa ttaacttatg ttcgcagtta 
gtgacaccat ctataacaca aggacaagaa agaaagtacg 
tgcatgccga catgatggag gcaagtacag aggaagtata 
tgtttggcat tgactgtgct agtggagaca cattcacaga 
ctatggagtc aattcatgtt cctgatcctg tcatttcaat 
agaacgatct ggaaaaattt tcaaaaggta ttggcaggtt 
ttaaagtata ctttgacact gagaacaaag agacagttat 
acctggaaat ctatgctcag aggctggaaa gagagtatgg 
agccaaaagt tgcctttcga gagaccatta ctgcccctgt 
aaaaacaatc aggtggtgca ggccagtatg gaaaagtaat 
acccagagga ctacactaaa ttggaatttt cagatgaaac 
agcagtttgt gcctgctgta gaaaaggggt ttttagatgc 
ctggtcacaa gctctctggg ctccggtttg tcctgcaaga 
attctaatga aatctctttc atccgagcag gagaaggtgc 
atgcaacatt atgtattctt gaacctatta tggctgtgga 
ttcagggaca agtaattgca ggaattaacc gacgccatgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



WO 01/70808 



PCT/US01/09609 



-15 



ggtaatcact 
aaatgatatg 
cacaatggag 
I taagtatttg 
ttgcttactg 
ttccagtgga 
ttctttcttc 
taattctatg 
taaaatataa 
ttatactttt 
acatagagaa 
tgttaacggg 
tatctgttta 
tatatcatga 
tccgtatgtt 
ccaaatttcn 
gtaactgcct 
aagtagaccc 
aatcaaatgt 
ttccttggaa 
tttgtttcca 
tcatccaaaa 
gaaataaaat 



gggcaagatg 
tttggttatt 
tatagcaggt 
gaagctacag 
tgagttgact 
ataaattcag 
aaaagaagcc 
tctatctcaa 
ttattactga 
aagcttagaa 
aagctgaaaa 
ttgtttatca 
agtctcataa 
cacttcttgg 
cttcatagag 
ctctgtttta 
ctgttttagg 
ttgcatatac 
aatttataga 
tttatcgtat 
gaaaaagttg 
atcatctgat 
aattgatttt 



gagttgagga 
ccactgaact 
atcagccatg 
gtcaacttcc 
gactctaatt 
gctgctgaaa 
cttcttgttc 
ctctattgat 
aatatgttta 
tgtatgttca 
aatgcaaaga 
tataataatt 
ctctattttt 
agtgtcatta 
agtgtttttc 
tatatgncag 
agtataagta 
tattcttgtt 
gttagtttca 
attgtattca 
ggctttccca 
ggtatagatg 



ctattttaca 
taggtcatgc 
tttaccabcc 
tgttaaaaaa 
gaatctgcgt 
caagaaattc 
atattcagga 
tggttttata 
atatttaagg 
tttccaaatt 
ataaccacat 
tgttttgtca 
agtttgctga 
atgggcaggc 
agattcttca 
tttatttcag 
ttacttcctt 
tgtgttcatc 
tcaacctaat 
ctgagattat 
agcagttcta 
gatcctagtc 



ctgtatgcag 
acagagggaa 
acacaagaag 
ggaaaagcca 
ggttttgata 
tgagcccagg 
gcttctgtta 
gttcattgaa 
ggaaaagaga 
ttgtatcata 
actttccatc 
tatttgcttt 
agacttgaaa 
ttttctgttg 
ttgggatatt 
tttgtggttt 
gtggtctatt 
ttaatgtttt 
gaatgctagt 
gaagggacaa 
ttacccggtt 
cttttcatta 



atgtccctct 
agggagaata 
acgtcattaa 
agaa ctaa ct 
ctttgatgga 
aagcgggctc 
tattcaaagg 
aatcctcaaa 
ctaatttcag 
agagttttca 
taccttcctt 
cactgtctat 
gtgaatcgca 
aagagtggat 
aaaatattag 
ctgcaaattt 
gtgaagtaaa 
tgtacagcta 
taaatttgaa 
atgttaatct 
cagaattgct 
cctgatggta 



Table 6 hEF-G polypeptide (SEQ ID NO:6) 



Met Arg Leu Leu Gly Ala Ma Ala Val Ala Ala Leu Gly Arg oiy Arg 

10 15 
Ala Pro Ala Ser Leu Gly Trp Gln ^ g hys ^ ^ ^ ^ ^ ^ 

Cys Arg Trp Ser Ser Ser Gly Val He Pro aon m T 

35 Y ™ Glu Lys Ile ^g Asn 

40 45 

Asp ser Gly Lys Thr Thr Leu Thr Glu 
" 60 

Arg Val Leu Tyr Tyr Thr Gly Arg Ile Ma Lys ^ ^ ^ ^ ^ 

75 80 
Oly Lys Asp Gly Val Gly Ala Val Met Asp Ser Met Glu Leu Glu Arg 

90 95 
gin Arg Gly He Thr_Ile Gin Ser Ala Ala ^ . . _ _ 



He Gly He Ser Ala His He 
50 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3501 
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100 



105 



110 



Asp Thr Pro Gly His Val Asp Phe Thr 
120 125 



Asp val Asn lie Asn lie He Asp Thr 
115 

He Glu Val Glu Arg Ma , ^ Val Leu ^ ^ ^ ^ ^ ^ 

135 140 
£» Cys Ala val Giy G1 Val Gln ^ flla Thr ^ ^ ^ ^ 



155 



Gin Met Lys Arg Tyr Asn Val Pro Phe Leu 



160 



165 



170 



Thr Phe He Asn Lys Leu 



175 



»» *, M.t «j ser to Pro Ma ^ Ml L „ „ ola ^ ^ 

185 190 
Lys Leu Asn His Asn Thr Ala Phe „ et Gin He Pro Met Gly Leu Glu 

200 205 

Oly Asn Phe Lys G ly He Val Asp Leu He Glu Glu Arg Ala He Tyr 

220 

Phe Asp Giy Asp Phe ser Gin He Val Ar g Tyr Gly Glu He Pro Ala 



235 



Glu Leu Arg Ala Ala Ala Thr Asp His Arg Gin 



240 



245 



250 



Glu Leu He Glu Cys 



255 



Val Ala Asn Ser Asp Glu Gin Leu Gl y Glu Met Phe Leu Glu Glu Lys 

2 65 



270 



He Pro ser He Ser Asp Leu Lys Leu Ala He Arg Arg Ala Thr Leu 



280 



285 



Lys Arg Ser Phe Thr Pro Val Phe Leu Gl y Ser Ala Leu L ys Asn L yS 



300 



Gly Val Gin Pro Leu Leu Asp Ala Val Leu Glu 



310 



315 



Tyr Leu Pro Asn Pro 
320 



Ser Glu val Gin Asn Tyr Ala He Leu Asn Lys L ys Asp Asp Ser L ys 



330 



Glu Lys Thr Lys He Leu Met Asn Ser Ser Arg Hi 

340 - 



335 



345 



s Asn Ser His Pro 
350 



- v,i Ma Ph6 Pro G11J yal 01y ^ ^ ^ 



365 



Thr Tyr Val Arg Ser Tyr Gin Gly Glu Leu Lys Lys Gly 



380 



Asp Thr He 



Tyr Asn Thr Arg Thr Arg Lys Lys Val Arg Leu 



390 



395 



Gin Arg Leu Ala Arg 



400 



Met His Ala Asp Met Met Glu Ala Ser Thr Glu Glu 
405 



Val Tyr Ala Gly 
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Asp lie Cys Ma Leu Phe Gly lie Asp Cys Ala Ser Gly Asp Thr Phe 



425 



430 



Thr Asp Lys Ala Asn Ser Gly Leu Ser Met Glu Ser He His Val Pro 
435 440 445 

Asp Pro val lie Ser lie Ala Met Lys Pro Ser Asn Lys Asn Asp Leu 

455 

Glu Lys Phe Ser Lys Gly lie Gly Arg Phe Thr Arg Glu Asp Pro Thr 
465 470 475 480 

Phe Lys Val Tyr Phe Asp Thr Glu Asn Lys Glu Thr Val He Ser Gly 
485 490 495 1 

Met Gly Glu Leu His Leu Glu lie Tyr Ala Gin Arg Leu Glu Arg Glu 



505 



510 



Tyr Gly Cys Pro Cys lie Thr Gly Lys Pro Lys Val Ala Phe Arg 

515 ~ ~ 3 



520 



Glu 



525 



Thr lie Thr Ala Pro Val Pro Phe Asp Phe Thr His Lys Lys Gin Ser 



535 540 

Gly Gly Ala Gly Gin Tyr Gly Lys Val He Gly Val Leu Glu 

545 



550 



555 



Pro Leu 
560 



Asp Pro Glu Asp Tyr Thr Lys Leu Glu Phe Ser Asp Glu Thr Phe Gly 
565 570 575 

Ser Asn lie Pro Lys Gin Phe Val Pro Ala Val Glu Lys Gly Phe Leu 
580 585 590 

Asp Ala Cys Glu Lys Gly Pro Leu Ser Gly His Lys Leu Ser Gly Leu 
595 600 605 

Arg Phe Val Leu Gin Asp Gly Ala His His Met Val Asp Ser Asn Glu 

615 620 

lie Ser Phe lie Arg Ala Gly Glu Gly Ala Leu Lys Gin Ala Leu Ala 
630 635 640 

Asn Ala Thr Leu Cys lie Leu <51u Pro He Met Ala Val Glu Val Val 

650 6 5 5 

Ala Pro Asn Glu Phe Gin Gly Gin Val He Ala Gly He Asn Arg Arg 
660 ggs 670 

His Gly Val He Thr Gly Gin Asp Gly Val Glu Asp Tyr Phe Thr Leu 
675 680 685 

Tyr Ala Asp Val Pro Leu Asn Asp Met Phe Gly Tyr Ser Thr Glu Leu 



695 



700 



Arg Ser Cys Thr Glu Gly Lys Gly Glu Tyr Thr Met Glu Tyr Ser Arg 

/u:a 710 715 3 



720 
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Tyr Gln Pro Cys ^ p-^i^FGlT^r^ p val i le Asn Iyi r ^- ) 

730 ?35 
Leu „» u , ^ „ «, „„ ^ , ^ lys My Lys iu 

745 



750 



Table 7 



gccgcgggag 
ctccccgcct 
| gccgagctgc 
aagctaattg 
aacgactgtt 
agacgacagg 
agcttgtttg 
aaatatgaag 
aaacttccag 
tcccttggtt 
aacagtgcca 
cttggcgatg 
tcaatatttc 
gcttttgagc 
gaaatggaag 
' atgcaagaaa 
tctggttccg 
atcaaactga 
cttgacttct 
atccttgtca 
gaaggaccca 
tacaaccgga 
atgatcgcca 
ctcaggggca 
acttgtcctc 
atcacacatt 
gtgctgaaga 
tgggcagcaa 
tctgccatct 
gcagactttc 
attacaattg 
acaaaacaat 



caggcggagg 
cggcggagac 
ggcagagcgt 
agccactcga 
tacgggagat 
gtcgatacat 
ttacagagtg 
attactcagg 
ttcatgtcta 
cccagaaagg 
tcagcgtgac 
gatcctataa 

tgggattcct 

tcaagatgca 
aatggatcac 
agcgaaatgg 
gtttagatag 
aaagtgaaag 
catcagctga 
agtgcaatga 
ctacaaatgt 
agatttctgc 
ccacgtcccc 
tccttcatga 
atccagatat 
gcgctgagcc 
atgccaagca 
ggacattgtt 
acaggcaaga 
ggaaacctga 
ataatgtttc 
ttgaaacctg 



hTRG nucleotide sequence (SEQ ID NO:7) 



cygaggcggc gggggcagga ggatgtcgca gccgccgctg 
tcggaagttc acccgggcgc tgagtaagcc gggcacggcg 
gtctgaggtg gtgcgcggct ccgtgctcct ggcaaagcca 
ctatgaaaat gtcatcgtcc agaagaagac tcagatcctg 
gctgctcttc ccttacgatg,actttcagac ggccatcctg 
atgctcaaca gtgcctgcga aggcggaaga ggaagcacag 
catcaaaacc tataactctg actggcatct tgtgaactat 
agagtttcga cagcttccga acaaagtggt caagttggat 
tgaagttgac gaggaggtcg acaaagatga ggatgctgcc 
tgggatcacc aagcatggct ggctgtacaa aggcaacatg 
catgaggtca tttaagagac gatttttcca cctgattcaa 
atttgaattt ttaaaagatc tccaaaagga accaaaagga 
gtatggggtg tcgttcagga acaacaaagt caggcgtttt 
ggacaaaagt agttatctct tggcagcaga cagtgaagtg 
aattctaaat aagatcctcc agctcaactt tgaagctgca 
cgactctcac gaagatgatg aacaaagcaa attggaaggt 
ctacctgccg gaacttgcca agagtgcaag agaagcagaa 
cagagtcaaa cttttttatt tggacccaga tgcccagaag 
gccagaagtg aagtcatttg aagagaagtt tggaaaaagg 
tttatctttc aatttgcaat gctgtgttgc cgaaaatgaa 
tgaacctttc tttgttactc tatccctgtt tgacataaaa 
cgatttccac gtagacctga accatttctc agtgaggcaa 
ggcgctgatg aatggcagtg ggccggaaac ccaatctgcc 
agccgccatg cagtatccga agcagggaat attttcagtc 
atttcttgtg gccagaattg aaaaagtcct tcaggggagc 
atatatgaaa agttcagact cttctaaggt ggcccagaag 
ggcatgccaa agactaggac agtatagaat gccatttgct 
taaggatgca tctggaaatc ttgacaaaaa tgccagattt 
cagcaataag ctatccaatg atgacatgct caagttactt 
gaagatggct aagctcccag tgattttagg caatctagac 
ctcagacttc cctaattatg ttaattcatc atacattccc 
cagtaaaact cccatcacgt ttgaagtgga ggaatttgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 | 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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ccctgcatac 
cctaagtact 
tgcattgaat 
agacctggtg 
aacccagaat 
cacctgttgc 
aagagggatg 
agggtggtga 
ggctaccagg 
ggcaagccac 
catttacata 
aacgaacttg 
ttcttgccca 
gtcgcggtta 
ttggagagcc 
tctgaataca 
tctgccgatt 
ctgatcaaat 
cagagatttc 
cacatcactc 
| gctgtcttca 
| aacaactaca 
[ gaatttctcc 
tttggaaaag 
gagttctgca 
caggagttcc 
cattcttttg 
ctgcctctgt 
cccttccctg 
aatccgctgg 
' ctgggcgcca 
! gtgagaaatg 
! gaaaggaata 
aattccgtgg 
ttcctctaca 
tcaacatctg 
tacatgggga 
ggcagcctgg 



caaaacacac 
tgaaatacga 
tcaaagattc 
ggccagtttt 
tttatgatga 
tcacattctt 
tcgttgaaac 
caagcgagca 
agcttgggat 
tgctgaaaat 
attttttcca 
taaagtacct 
ctatcctaaa 
acgtgactcg 
acttgaggtc 
agacagtgca 
tcctcaccag 
ctatggctca 
ctgcatccta 
agaagtttcg 
tcaagagatg 
ttagctgttt 
gtgtagtgtg 
gcaggattca 
gaaaccactt 
gggaggtccg 
atgacagata 
ttggtctgct 
tgaacgcggg 
tgacgccgca 
tctccggcat 
ctgattcgag 
gtgagaagag 
ttcgctgtga 
tcttaaagag 
aacttatgga 
agcgatacat 
ataactctct 



tcagccttac 
cagtcagaag 
agatgaggaa 
cacaagaagc 
gattaaaata 
ccatgtcagc 
ccaagttggc 
gcacatcccg 
gggcaggcat 
ttccactcat 
gtactgtcag 
taagagtctg 
ccagctgttc 
ggtcattatt 
atatgttaag 
tgaagaactg 
caacaaacta 
gcatttgata 
tcatcatgca 
agataatcca 
tttcaccttc 
tgctcctgga 
caaccatgaa 
aagataccaa 
cttggtggga 
tctgatcgcc 
tgcttcaagg 
gattgaaaac 
catgactgtg 
gaagggaagc 
tgcttctcca 
aggatctctc 
caattccctg 
taaacttgac 
catgtctgat 
tttttttaca 
agccagaaca 
cacttttaac 



accatctaca 
tcttttgcca 
gactctcagc 
gcctttgctg 
gagttgccca 
tgtgacaact 
tactcctggc 
gtctcggcga 
tatggtccgg 
ctggtttcta 
aaaaccgaat 
catgcgatgg 
cgagtcctca 
catgtggttg 
tacgcgtata 
accaaatcca 
ctgaagtact 
gagaactcca 
gtggaaaccg 
gaggcatcta 
atggacaggg 
gacccaaaga 
cattatattc 
gacctccagc 
ctgttactga 
atcagtgtgc 
agccatcagg 
gtccagcgga 
aaggatgaat 
accctggaca 
tatacaacct 
ataagcacag 
gataagcacc 
cagtctgaga 
gatgctttgt 
atatctgaag 
ggaatgatgc 
cacagctatg 



ccaatcacct 
aggctagaaa 
cccttaagtg 
cagttttaca 
ctcagctgca 
caagtaaagg 
ttcccctcct 
accttccttc 
aaattaaatg 
cagtgtatac 
ctggagccca 
aaggccacgt 
ccagagccac 
cccagtgcca 
aggctgagcc 
tgaccacgat 
catggttttt 
aagttaagtt 
ttgtaaatat 
agaacgcgaa 
gctttgtctt 
ccctctttga 
cgttgaactt 
ttgactactc 
gggaggtggg 
tcaagaacct 
caaggatagc 
tcaatgtgag 
ccctggctct 
acagcctgca 
caactccaaa 
attcgggtaa 
aacaaagtag 
ttaagagcct 
ttacatattg 
tctgcctgca 
atgccagatt 
gccactcgga 



ttacgtttat 
tattgcgatt 
catttatggc 
ccatcaccaa 
tgaaaagcac 
aagcacgaag 
gaaagacgga 
gggctatctt 
ggtagatgga 
tcaggatcag 
agccttagga 
gatgatcgcc 
acaggaagaa 
tgaggaagga 
atatgttgcc 
tctcaagcct 
ctttgatgta 
gctgcgaaac 
gctgatgcca 
tcatagcctt 
caagcagatc 
atacaagttt 
accaatgcca 
attaacagat 
gacagccctc 
gctgataaag 
caccctctac 
ggatgtgtca 
accagctgtg 
caaggacctg 
catcaacagt 
cagccttcca 
cacattggga 
actgatgtgt 
gaacaaggct 
ccagttccag 
gcagcagctg 
cgcagatgtt 



1980 
2040 
2100 
2160' 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
. 2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
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ctgcaccagt 
acgctttctc 
cctctcatga 
acggctttaa 
ttctatgaag 
aactccaagc 
aacaactttg 
tctgtcagcc 
ctgtccatca 
tctgatgtga 
aaggagcatg 
tatgccagca 
aaaaatggcg 
gaatatctca 
agcgcctgcc 
gtcattaccc 
gtccatttca 
aaagccgagc 
aagcggaggg 
aaagtgaccg 
ttcttcgggc 
tttgaagatg 
gaaatttctc 
atgatacagg 
caggtgactc 
tttgagagat 
aagaggcagg 
tgcttccctt 
cccatcgagg 
tcctcggccg 
caggtcaatg 
cgatatcctg 
tgcggtcaag 
gaagaaatga 
cagatctgcc 
gccatcagtg 
gt gtga ttac 
tgctttccaa 



cattacttga 
tatttacatt 
aaaaagtttt 
aaaatgtctt 
ggagagcgga 
tgagctccat 
attacactgg 
agctgatagc 
tcaacaactg 
aggacttaac 
agaacgaccc 
cgcccgagct 
atctctcaga 
cacggaaaga 
tgaggaggag 
caaacatcga 
acgaggatgt 
gctacgagct 
attttgagag 
aggtcatgca 
aggcagcgca 
aagatggaaa 
agagactcct 
attctggcaa 
acgtcatccc 
cccacaacat 

# c ggggtgga 

atgtgaagaa 
tggccattga 
aggtggacat 
ctggcccact 
acaataaagt 
ccttagcggt 
aagccaacta 
ccctggagga 
ggactccaac 
atctcatggc 
agccaatcac 



agccaacatt 
ggcgtttaag 
tgatgtctac 
cactgcctta 
catgtgtgcg 
caggacggag 
aaagaagtcc 
agacgttgtt 
tgccaacagt 
caaaaggata 
agagatgctg 
caggaagacg 
ggcagcaatg 
agcagtccag 
ccggggaggc 
cgaggaggcc 
gctgatggag 
cattgccgac 
gctggcccat 
ctcgggccgc 
ataccagttt 
ggagtatatt 
taaactgtac 
ggtcaaccct 
cttctttgac 
ccgccgcttc 
agagcagtgc 
gcgcatccct 
cgagatgagt 
gatcaaactg 
agcatatgcg 
gaagctgctt 
aaacgaacgt 
cagggaaatg 
gaagacgagc 
aagcacaatg 
ccgtgtgtgg 

fc ggggagacc 



gctactgagg 
aaccagctcc 
ctgtgttttc 
aggtccttaa 
gctctgtgtt 
gcctcccagc 
tttgtccgga 
ggcattgggg 
gaccggctta 
cgcacggtgc 
gtggacctcc 
tggctcgaca 
tgctatgtcc 
tgggagccgc 
gtgtttagac 
tccatgatgg 
ctccttgagc 
atctacaaac 
ctgtatgaca 
aggcttctgg 
acagacagtg 
tacaaggaac 
tcggataaat 
aaggatctgg 
gaaaaagagt 
atgtttgaga 
aaacggcgca 
gtcatgtacc 
aagaaggtgg 
cagctcaaac 
cgagctttct 
aaggaagttt 
ctgattaaag 
gcgaaggagc 
gtcttaccga 
gttcacggga 
ggacttgctt 
gagcacaggg 



tttgcctgac 
tggccgacca 
ttcaaaaaca 
tttataagtt 
acgagattct 
tgctctactt 
cacatttgca 
gaaccagatt 
ttaagcacac 
taatggccac 
agtacagcct 
gcatggccag 
acgtaacagc 
cccttctccc 
aaggatgcac 
aagacgtggg 
agtgcgcaga 
ttatcatccc 
cgctgcaccg 
ggacctactt 
aaacagatgt 
ccaaactcac 
ttggttctga 
attctaagta 
tgcaagaaag 
tgccatttac 
ccatcctgac 
agcaccacac 
cggagctccg 
tccagggcag 
tagatgatac 
tcaggcaatt 
aagaccagct 
tttctgaaat 
attcccttca 
tgaccagctc 
tgtcatttgc 
aggaccaagg 



agctctggac 
tggacataat 
tcagtctgaa 
tccctcaaca 
caagtgctgt 
cctgatgagg 
agtcatcata 
ccagcagtcc 
cagcttctcc 
cgcccagatg 
ggccaaatcc 
gatccatgtc 
cctagtggca 
ccacagccat 
cgccttcagg 
gatgcaggat 
tggactctgg 
catttatgag 
ggcctacagc 
ccgggtagcc 
ggagggattc 
accgctgtcg 
aaatgtcaaa 
tgcctacatc 
gaaaacagag 
gcagaccggg 
agccatacac 
tgacctgaac 
gcagctgtgc 
cgtgagtgtt 
aaacacaaag 

tgtggaagct 

cgagtatcag 
catgcatgag 
catcttcaac 
gtcttcggtc 
aaactcagga 
ggaaggggag 



4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
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atacansl-af- f-t-*-t-4-i 



gtgcacatat tttttUut ctcact gg ca atattcaaag ttttcattgt gtcttaacaa 
aggtgtggta gacactcttg agctggactt agattttatt cttccttgca gagtagtgtt 
agaatagatg gcctacagaa aaaaaaggtt ctgggatcta catggcaggg ^ 
tgac tt t gcctggggga ccttttgccfc cgaggctgag ^^^^ ttgaaaafcat 

tttttttttc ctgtggcaca ttcaggttga atacaagaac tatttttgtg actagttttt 
gatgacctaa gggaactgac cattgtaatt tttgtaccag tgaaccagga gatttagtgc 
ttttatattc atttccttgc atttaagaaa atatgaaagc ttaaggaatt atgtgagctt 
aaaactagtc aagcagttta gaaccaaagg cctatattaa taaccgcaac tatgctgaaa 
agtacaaagt agtacagtat attgttatgt acatatcatt gttaatacag tcctggcatt 
ctgtacatat atgtattaca tttctacatt tttaatactc acatgggctt atgcattaag 
tttaattgtg ataaatttgt gctgttccag tatatgcaat acactttaat gttttattct 
tgtacataaa aatgtgcaat atggagatgt atacagtctt tactatatta ggtttataaa 
cagttt taag aatttcatcc ttttgccaaa atggtggagt atgtaattgg taaatcataa 
atcctgtggt gaatggtggt gtactttaaa gctgtcacca tgttatattt tcttttaaga 
ctttaattta gtaattttat atttgggaaa ataaaggttt ttaattttat ttaactggaa 

c i: t T CCt 9Ct9taatta ^^"^ ™* tat — -00 

750G 



Table 8 hTRG polypeptide sequence (SEQ ID NO:8) 



t fc ser n pr ° r l - ^^--^^^ 

10 15 

Thr Arg Ala Leu Ser Lys Pro Gly Thr Ala Ala Glu Leu Arg Gin Ser 

25 30 
Val ser Glu Val Val Arg Gly Ser Val Leu Leu Ala Lys Pro Lys Leu 

40 45 

He Glu Pro Leu Asp Tyr Glu Asn Val lie Val ri« t 

50 C c 1 Gln h y s L y s Thr Gin 

55 60 

lie Leu Asn Asp Cys Leu Arg Glu Met Leu Leu Phe Pro ^ ^ ^ 

75 80 
Phe Gin Thr Ma ^ e Leu ^ ^ Qln Qiy ^ ^ ne ^ g ^ ^ 

90 95 
Val Pro Ala Lys Ala Glu Glu Glu Ala Gin Ser Leu Phe Val Thr Glu 

105 110 
cys He g. Thr ^ _ ser ^ „ is ^ ^ ^ 

"° 125 
Olu Asp Tyr Ser Gly Glu Phe Arg Gin Leu Pro Asn Lys Val Val Lys 

135 140 



6540 
6600 
6660 
6720 
6780 
6840. 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
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Leu Asp Lys Leu Pro Val His Val Tyr Glu Val Asp Glu Glu 



150 



155 



Lys Asp Glu Asp Ala Ala Ser Leu Gly Ser Gin Lys Gly Gly 



170 



Val Asp 
160 

He Thr 
175 



Lys His Gly Trp Leu Tyr Lys Gly Asn Met Asn Ser 



180 



185 



Ala He Ser Val 
190 



Thr Met Arg Ser Phe Lys Arg Arg Phe Phe His Leu lie Gin Leu Gly 



205 



Asp Gly ser Tyr Lys Phe Glu Phe Leu Lys Asp Leu Gin Lys Glu 



215 



220 



Pro 



Lys Gly ser lie Phe Leu Gly Phe Leu Tyr Gly Val Ser Phe 



230 



235 



Arg Asn 
240 



Asn Lys Val Arg Arg Phe Ala Phe Glu Leu Lys Met Gin 



245 



250 



Asp Lys Ser 
255 



Ser Tyr Leu Leu Ala Ala Asp Ser Glu Val Glu Met Glu Glu Trp lie 



265 



270 



Thr lie Leu Asn Lys He Leu Gin Leu Asn Phe Glu Ala Ala Met Gin 



280 



285 



Glu Lys Arg Asn Gly Asp Ser His Glu Asp Asp Glu Gin 



295 



300 



Glu Gly Ser Gly Ser Gly Leu Asp Ser Tyr Leu Pro Glu 



310 



315 



Ser Lys Leu 



Leu Ala Lys 
320 



Ser Ala Arg Glu Ala Glu lie Lys Leu Lys Ser Glu ser Arg Val Lys 

330 335 
Leu Phe Tyr Leu Asp Pro Asp Ala Gin Lys Leu Asp Phe Ser Ser Ala 

345 350 

Glu Pro Glu val Lys Ser Phe Glu Glu Lys Phe Gly Lys Arg lie Leu 

360 365 

Val Lys cys Asn Asp Leu Ser Phe Asn Leu Gin Cys Cys Val Ala Glu 

375 380 
Asn Glu Glu Gly Pro Thr Thr Asn Val Glu Pro Phe Phe Val Thr Leu 



395 



400 



Ser Leu Phe Asp He Lys Tyr Asn Arg Lys He Ser Ala 



405 



410 



Asp Phe His 
415 



Val Asp Leu Asn His Phe Ser Val Arg Gin Met He 



420 



425 



Ala Thr Thr Ser 
430 



Pro Ala Leu Met Asn Gly Ser Gly Pro Glu Thr Gin Ser Ala Leu Arg 



445 
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Phe 



Gly Hi ^ H1S Glu ^ ««t Gin Tyr Pro Lys Gl^T^ 

455 460 

Ser Val Thr Cys Pro His Pro Asp He Phe Leu Val Ala Arg He Glu 



475 



Lys Val Leu Gin Gly Ser He Thr His Cys Ala Glu Pro 



485 



490 



480 



Tyr Met Lys 



495 



Ser Ser Asp Ser Ser Lys Val Ala Gin Lys Val Leu Lys Asn Ala 



505 



510 



Lys 



Gin Ala Cys Gin Arg Leu Gly Gin Tyr Arg Met Pro Phe Ala Trp Ala 



520 



525 



Ala Arg Thr Leu Phe Lys Asp Ala Ser Gly Asn Leu Asp Lys 

535 540 

Arg Phe Ser Ala He Tyr Arg Gin Asp Ser 



Asn Ala 



545 



550 



Asp Ser Asn Lys Leu Ser Asn Asp 



555 



560 



Asp Met Leu Lys Leu Leu Ala Asp Phe Arg Lys Pro Glu Lys Met Ala 

boo 



570 



575 



Lys Leu Pro Val lie Leu Gly Asn Leu Asp He Thr He Asp 



585 



Asn Val 



590 



Ser Ser Asp Phe Pro Asn Tyr Val Asn Ser Ser Tyr lie Pro Thr Lys 



605 



Gin Phe Glu Thr Cys Ser Lys Thr Pro lie Thr. Phe Glu Val Glu Glu 



615 



620 



Phe Val Pro Cys lie Pro Lys His Thr Gin Pro Tyr Thr 



630 



635 



Asn His Leu Tyr Val Tyr Pro Lys Tyr Leu Lys Tyr Asp 
645 



650 



He Tyr- Thr 
640 

Ser Gin Lys 
655 



Ser Phe Ala Lys Ala Arg Asn He Ala lie Cys lie Glu Phe Lys Asp 

665 67Q 

Ser Asp Glu Glu Asp Ser Gin Pro Leu Lys Cys He Tyr Gly Arg 



680 



Pro 



685 



Gly Gly Pro val Phe Thr Arg Ser Ala Phe Ala Ala Val Leu His His 



695 



700 



His Gin Asn Pro Glu Phe iyr Asp Glu He Lys He Glu 



710 



715 



Leu Pro Thr 
720 



Gin Leu His Glu Lys His His Leu Leu Leu Thr Phe 



725 



730 



Phe His Val Ser 
735 



Cys Asp Asn ser Ser Lys Gly Ser Thr Lys Lys Arg Asp Val Val Glu 

745 750 
Thr Gin Val Gly Tyr Ser Tro Le u Pro Leu Leu Lv» a , p Gly Arq Val 
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755 



760 



765 



Val Thr Ser Glu Gin His lie Pro Val Ser Ala Asn Leu Pro Ser Gly 

775 780 

Tyr Leu Gly Tyr Gin Glu Leu Gly Met Gly Arg His Tyr Gly Pro Glu 
790 800 

He Lys Trp Val Asp Gly Gly Lys Pro Leu Leu Lys lie Ser Thr His 
805 810 815 

1 Leu Val Ser Thr Val Tyr Thr Gin Asp Gin His Leu His Asn Phe Phe 

• 825 830 

Gin Tyr Cys Gin Lys Thr Glu Ser Gly Ala Gin Ala Leu Gly Asn Glu 

Leu Val Lys Tyr Leu Lys Ser Leu His Ala Met Glu Gly His Val Met 

855 860 

lie Ala Phe Leu Pro Thr He Leu Asn Gin Leu Phe Arg Val Leu Thr 

y/5 880 
Arg Ala Thr Gin Glu Glu Val Ala Val Asn Val Thr Arg Val lie lie 
885 830 895 

His Val Val Ala Gin Cys His Glu Glu Gly Leu Glu Ser His Leu Arg 

905 910 

Ser Tyr Val Lys Tyr Ala Tyr Lys Ala Glu Pro Tyr Val Ala Ser Glu 

920 925 

Tyr Lys Thr Val His Glu Glu Leu Thr Lys Ser Met Thr Thr lie Leu 

935 94Q 

Lys pro Ser Ala Asp Phe Leu Thr Ser Asn Lys Leu Leu Lys Tyr Ser 
950 9 " 960 

Trp Phe Phe Phe Asp Val Leu He Lys Ser Met Ala Gin His Leu He 
965 970 S75 

| Glu Asn Ser Lys Val Lys Leu Leu Arg Asn Gin Arg Phe Pro Ala Ser 
| 985 990 

Tyr His His Ala Val Glu Thr Val Val Asn Met Leu Met Pro His 



995 



1000 



1005 



Thr Gin Lys Phe Arg Asp Asn Pro Glu Ala Ser Lys Asn Ala Asn 

1015 1020 

His Ser Leu Ala Val Phe lie Lys Arg Cys Phe Thr Phe Met Asp 

1030 1035 

Arg Gly Phe Val Phe Lys Gin tie Asn Asn Tyr He Ser Cys Phe 

1045 1050 

Wpro G iy Asp Pro Lvs Thr- LeU Phe Glu Tvr Lys Phe Glu Phe 
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1055 



1060 



1065 



Leu jrg val Val Cys Asn His Glu His Tyr lie Pro Leu Asn Leu 

1075 1080 

Pro Met Pro Phe oly Lys Gly Arg xie am Arg Tyr Gln Asp Leu 

1090 1095 

Gin £» Asp Tyr Ser Leu Thr Asp Glu Phe Cys Arg Asn His Phe 

1105 1110 

Leu Val__ Gly Leu Leu Leu Arg Glu Val Gly Thr Ala Leu Gin Glu 

1120 1125 

Phe Ar^ Glu Val Arg Leu lie Ala lie Ser Val Leu Lys Asn Leu 

1135 1140 
Leu lie Lys His ser Ph6 ftsp ^ ^ Ma ^ 

1150 1155 

Gin Al. Arg lie Ala Thr Leu Tyr Leu Pro Leu Phe Gly Leu Leu 

1165 11?0 

He Glu Asn Val Gin Arq He A<?n v a i * „ 

1175 ^ 9 ^ Asn Val Arg Asp Val Ser Pro Phe 

1180 1185 

Pro vji Asn Ala Gly Met Thr Val Lys Asp Glu Ser Leu Ala Leu 

1195 1200 
Pro Ala s val Asn Pro Leu Val Thr Pro Gin Lys Gly Ser Thr Leu 

1210 1215 

Asp So ser Leu His Lys Asp Leu Leu Gly Ala lie Ser 0 ly He 

1225 1230 
Ala ^ Pro ^ Thr ^ Thr ^ ne ^ 

1240 1245 

Asn ji. Asp ser Arg Gly s er Leu lie Ser Thr Asp Ser Gly Asn 

1255 1260 
Ser £u Pro G i u ^ ^ Glu Lye s ^ ^ ^ ^ 

1270 12?5 

I Sao GlD S6r Ser Thr Ss ASn S " Val Val <* s 

1285 129Q 

Lys Leu 5 Asp Gin Ser Glu lie Lys Ser Leu Leu Met Cy S Phe Leu 

1300 1305 

Tyr ne Leu Lys Ser Met Ser Asp Asp Ala Leu Phe Thr Tyr Trp 

1315 1320 

Asn Lys_ Ala Ser Thr Ser Glu Leu Met Asp Phe Phe Thr He Ser 

1330 1335 

Glu Val cys Leu His Gin Phe Gin Tyr Met Gly Lys Arg Tyr He 
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Ala £g Thr Gly Met Met His Ala Arg ^ Gln Gln Leu Qly ^ 

1360 1365 

Leu j.p Asn Ser Leu Thr ^ His ^ ^ ^ ^ ^ 

1375 1380 
Ala Asp s val Leu His Gln Ser Leu Leu Glu Ala Asn lie Ala Thr 

1390 13g5 

I Glu vji cys Leu Thr Ala Leu Asp Thr Leu Ser Leu Phe Thr Leu 

1405 . 1410 

I Ala Ph^ Lys Asn Gln Leu Leu Ala Asp His Gly His Asn Pro Leu 

1420 1425 

[ Met Lys val Phe Asp Val Tyr Leu Cys Phe Leu Gln Lys His 

14J5 144Q 

I JEs G1U Thr Ma LeU *•» Val Phe Thr Ala Leu Arg Ser 

1450 1455 

! I-eu ii. ryr L ys Phe Pro Ser Thr Phe Tyr Glu Gly Arg Ala Asp 

1465 147 q 
Met cj. Ala Ala Leu Cys Tyr Glu lie Leu Lys Cy S ' Cys Asn Ser 

1480 14g5 

Lys Le^ ser Ser He Arg Thr Glu Ala Ser Gln Leu Leu Tyr Phe 

1495 1500 

Leu Met s Arg Asn Asn Phe Asp Tyr Thr Gly Lys Lys Ser Phe Val 

j.bio 1515 

Arg T^ His Leu Gln Val lie Iie Ser Val Ser Gln Leu ^ ^ 

1525 1530 
Asp val Gly Xle Gly Gly Thr Arg Phe Gln Gln Ser Leu Ser 

1540 1545 

Ha lie Asn Asn Cys Ala Asn Ser Asp Arg Leu He Lys His Thr 

1555 1560 
Ser Phe s Ser Ser Asp Val Lys Asp Leu Thr Lys Arg He Arg Thr 

1570 15?5 

Val lju Met Ala Thr Ala Gln Met Lys Glu His Glu Asn Asp Pro 

1585 159Q 

' Olu jjt Leu Val Asp Leu Gln Tyr Ser Leu Ala Lys Ser Tyr Ala 

1600 1605 

Ser Thr Q Pro Glu Leu Arg Lys Thr Trp Leu Asp Ser Met Ala Arg 

1615 1620 

lie a. val Lys Asn Gly Asp Leu Ser Glu Ala Ala Met Cys Tyr 

1630 1635 
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1645 1650 
Ala Va^ Gln Trp Glu Pro Pro Leu Leu Pro His Ser His Ser Ala 

1660 1665 
Cys Le^ Arg Arg Ser Arg G1 Gly m pfae ^ ^ ^ ^ 

1675 1680 

| Ma 116 ^ So ASP G1U Glu Ala S - *t 

1690 1695 

Met «l« Asp val Gly „ et Asp yal ^ ^ 

1705 1710 

Leu Me^ Glu Leu Leu Glu ^ Ma ^ ^ ^ ^ Ma 

1720 1725 
! GlU Tyr Glu Leu ne ^ ne ^ ^ ^ ne ne ^ 

1735 1740 
He Ty^ Glu Lys Arg Arg Asp Phe Glu Arg Leu Ala His Leu Tyr 

1750 1755 

Asp T.r Q Leu His Arg Ala Tyr Ser Lys Val Thr Glu Val Met His 

1765 , 177Q 

Ser Gl^ Arg Arg Leu Leu Gly Thr Tyr Phe Arg Val Ala Phe Phe 

1780 1785 

Gly ai» o Ala Ala Gin Tyr Gin Phe Thr Asp Ser Glu Thr Asp Val 

1795 1800 

«. a* P te Phe 01u „ ^ My lya ^ ^ 

1810 1815 

Glu Pro Lys Leu Thr Pro Leu Ser Glu He s^r n« » t 

1820 lfl „ ±u Iie Ser Gln Arg Leu Leu 

1825 1830 

Lys lju Tyr Ser Asp Lys phe Gly Ser Glu Asn Val Lys Met xie 

1840 1845 

GlD lllo Gly Lys Val JSs Pr ° LyS Leu ser Tyr 

1855 1860 

| m, n. Gla val Ite m u> pro phe mu 

1870 1875 

I Glu So Gln Glu Lys s s Glu phe Glu ser His *« «• 

1885 1890 
| Arg Ar^ Phe Met Phe Glu Met Pro Phe Thr Gln Thr Gly Lys Arg 

1900 19Q5 

Gln Gl^ Gly Val Glu Glu Gln Cys Lys Arg Arg Thr He Leu Thr 

1915 1920 
^ne_ His cys Phe Pro jry ^gi Lys Lys Arg He Pro Val Met 



WO 01/70808 



PCT/US01/09609 



28 



1925 1930 



1935 



^ il*o HiS HlS Tbr ASP Leu Asn Pro Ile Glu Val Ala Ile 

1945 1950 
Glu Met^ Ser Lys Lys Val Ala Glu Leu Arg Gin Leu Cys Ser Ser 



1965 



1970 ^ ^ L6U Gln L6U L ^ S Leu Gin Gly Ser 

1975 i 980 

Val Ser Val Gin Val Asn Ala Gly Pro Leu Ala Tyr Ala Arg Ala 
1985 1990 1995 3 

Phe Le^ Asp Asp Thr Asn Th^ Lys Arg Tyr Pro As^ Asn Lys Val 

Lys Leu Leu Lys Glu Val Phe Arg Gin Phe Val Glu Ala Cys Gly 

2020 2025 

GlQ 2^30 Ma ASn G1U Leu Ile ^ Glu Asp Gin Leu 

2030 2035 2040 

Glu Tyr_ Gin Glu Glu Met Lys^ Ala Asn Tyr Arg Glu_ Met Ala Lys 

61U 2oL HlS G1U GlD Ile <* s Pr ° Glu Glu 

zuou 2065 — 



2070 



Lys Thr Ser Val Leu Pro Asn Ser Leu His lie Phe Asn Ala" lie 
2075 2080 2085 

^ Thr M6t Val His ^ Met Thr Ser ser 
ZOB0 2095 2100 

Ser Ser Val Val 
2105 



Table 9 hMXl nucleotide sequence (SEQ ID NO:9) 



ttttgtttac agggaacacg ggtctggctg agagaaa atg gccagcattt tccaagtact 
gtaaattcct gtgcagaagg catcgtcgtc ttccggacag actatggtca ggtattcact 
tacaagcaga gcacaattac ccaccagaag gtgactgcta tgcaccccac gaacgaggag 
ggcgtggatg acatggcgtc cttgacagag ctccatggcg gctccatcat gtataactta 
ttccagcggt ataagagaaa tcaaatatgg acctacatcg gctccatcct ggcctctgtg 
aacccctacc agcccatcgc cgggctgtac gagcctgcca ccatggagca gtacagccgg 
cgccacctgg gcgagctgcc cccgcacatc ttcgccatcg ccaacgagtg ctaccgctgc 
ctgtggaagc gccacgacaa ccagtgcatc ctcatcaagg gtgaaagtgg ggcaggtaaa 
accgaaagca ctaaattgat cctcaagttt ctgtcagtca tcagtcaaca gtctttggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ttgtccttaa 
atggaagctt 
tttgttcagc 
atcctctctt 
ttttatgcac 
ccagaaaact 
caggaatcct 
gttcgggaag 
actgctggtg 
gggcfcggacc 
gaagagatcc 
atggctctgt 
ggcaatgagg 
gttaatcact 
aacaagcata 
gatattgact 
ctagccctta 
aagctacaca 
aattttggag 
aagaacagag 
tttatctacg 
ggaagcaaac 
ttaatggcaa 
cagaagatgc 
atgctggaga 
ttttacaaaa 
aagtgcacga 
accaaggtat 
gtgagccacg 
tacagaaagg 
aggaggagat 
cagattgctc 
aagaaacagg 
1 9 a 9cgaagag 
! ctcgaagcct 
aaggaaaata 
cgcatgaagg 
c Srgcgggacc 



aggagaagac 
tcggcaatgc 
tgaacatctg 
cccagaaccg 
tgctggcagg 
accactactt 
ttagggaagt 
tgtcgaggct 
gggcacaggt 
caacacagct 
tcacgcctct 
atgcgtgctg 
acttcaagtc 
ttgaacagtt 
ttttttcttt 
ggatagacaa 
tcaatgaaga 
gtcagcatgc 
tgaagcacta 
atacatttcg 
atctttttga 
atcggcggcc 
cgctaagctc 
cagaccagtt 
ctgtgagaat 
ggtataaagt 
gcctgctgca 
ttcttcgaga 
cggccatggt 
tcctttattg 
ttttgcacct 
ggagagttta 
aagaggaaga 
aagccgagct 
tgcagaagag 
agcaggtgga 
agcagcagga 
aggagctccg 



atcctgtgtt 
gaagaccgtg 
tcagaaagga 
agtagtaagg 
gctggaacat 
gaatcagtct 
tattacggca 
gcttgctggt 
ttccttcaaa 
cacagatgct 
caatgttcaa 
ctttgagtgg 
tattggcatc 
caatataaac 
agaacaacta 
tggagaatgc 
aagccatttt 
gaataaccac 
tgctggagag 
agatgacctt 
acatgtttca 
tacagtcagc 
ctctaatcct 
tgaccaggcg 
ccgcaaagct 
gctgatgagg 
gctctatgat 
atccttggaa 
gattcgggcc 
tgtggtgata 
gaaaaaggca 
cagacaattg 
aaagaagaaa 
ccgcgcccag 
ccagaaggaa 
agagatcctc 
gctgtcgctg 
caggctggag 



gaacgagcta 
tacaacaaca 
aatattcagg 
caaaatcccg 
gaagaaagag 
ggatgtgtag 
atggacgtga 
atactgcatc 
acagctttgg 
ttgacccaga 
caggcagtag 
gtaatcaaga 
ctcgacatct 
tatgcaaacg 
gaatatagca 
ctggacttga 
cctcaagcca 
ttttatgtga 
gtgcaatatg 
ctcaatttgc 
agccgcaaca 
tcacagttca 
ttctttgttc 
gttgtgctga 
gggtatgcgg 
aatctggctc 
gcctccaaca 
cagaaactgg 
catgtcttgg 
atacagaaga 
gccatagttt 
ctggcagaga 
c 9ggaggaag 
caggaagaag 
gctgaactga 
cgtctggaga 
accgaggctt 
gaggaagcgt 



ttcttgaaag 
actctagtcg 
gcgggagaat 
gggaaaggaa 
aagaatttta 
aagacaagac 
tgcagttcag 
ttgggaacat 
gcagatctgc 
gatcaatgtt 
acagcaggga 
agatcaacag 
ttggatttga 
agaaacttca 
gggaaggatt 
ttgagaagaa 
cagacagcac 
agcccagagt 
atgtccgagg 
taagagaaag 
accaggatac 
aggttgactc 
gctgtatcaa 
accagctgcg 
tccgaagacc 
tgcctgagga 
gcgagtggca 
agaagcggag 
gcttcttagc 
attacagagc 
tccagaagca 
aaagggagca 
aagaaagaga 
aaacgaggaa 
cccgtgaact 
aagaaatcga 
ccctgcagaa 
gcagggcggc 



cagccccatc 
ctttgggaag 
tgtagattgt 
ttatcacata 
tttatctacg 
aatcagtgac 
caaggaggaa 
agaatttatc 
ggagttactt 
cctcagggga 
ctccctggcc 
caggatcaaa 
aaactttgag 
ggagtacttc 
agtgtgggaa 
acttggcctc 
cttattggag 
tgcagttaac 
tatcttggag 
ccggtttgac 
cttgaaatgt 
actgcattcc 
gccaaacatg 
gtactcaggg 
ctttcaggac 
cgtccgaggg 
gctggggaag 
ggaagaggaa 
acggaaacaa 
attccttctg 
actcagaggt 
agaagaaaag 
aagagagaga 
gcagcaagaa 
ggagaaacag 
ggacctgcag 
gctgcaggag 
ccaggagttc 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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ctcgagtccc 
tcggggggaa 
ttcaacttca 
gacgccttca 
ggcatccgga 
cccaccagcc 
agcctacaca 
ttgccctccc 
acttccggca 
cgctgctctg 
cagtcctcgt 
ctttcatacc 
ctgtacatga 
gaaaccttct 
aaaggggggg 

cgccagtcca 
gaagtgcgaa 
attatggccg 
ttcagcgtgc 
gagcaggcaa 
tgtgcctctg 
ctgcactgca 
tccaaagggg 
gaggtgaaga 
acccacaatt 
ctggtcctca 
ggctactgga 
ctcaacgagg 
atcgacaccc 
gtggtggaac 
tccccgctcc 
tataccaccc 
atgtctgacc 
ctgcgggacg 
agtgtgggca 
agtcgaggga 
ggaaccgaga 
cgagagtttg 



tcaatttcga 
gcgaattttc 
gccagcccta 
aggactcccc 
ccagcgatga 
ccagtgcgga 
actcctccag 
cagacggcga 
gcagcgtgac 

tggggaccta 
ttgaagatag 
ggcgtgactc 
aaggtggcct 
tgtggttccg 
gctcctccac 
agctgatgta 
cggcaaaaga 
ataggacttt 
tgagtcaggt 
acccacagaa 
acagccctga 
acgccgacac 
acaccagagt 
acagtccaaa 
ccctggatta 
acagcctctg 
acgtcaccgt 
ccacccggtg 
ccacccagca 
agatttacaa 
tgccccttcc 
ttcaggatga 
caattccaat 
agctgtactg 
acctgtacag 
ttctcaagta 
tggaaaaata 
tgccttcccg 



cgagatcgac 
cagcgagctg 
cccagaggag 
caaccccagc 
ctcttcagag 
cagcacggtg 
cggcgagtcc 
ctacgactac 
cttctccaac 
caacagctcg 
tgaagaggac 
tgtgtacagc 
gatgaactct 
ctccaagcag 
gctgtccagg 
ctttgaaaac 
gatcatagat 
ccacctgatt 
ccacgcgtcc 
tgctgtgggc 
tagacccaac 
gccggaggag 
ggagggccag 
gatgtcttca 
ctacaagagt 
ctctgtcgtc 
gtacgggcgc 
gtccagtgtc 
gctgattcaa 
gcggaacccg 
gtatggggac 
ggccatcaag 
aatccagggc 
ccagcttatc 
ctggcagatc 
tctcaagttc 
cgctctcttc 
agatgaaata 



gagtgtgtcc 
gctgagagcg 
gaggtcgatg 
gagcacggcc 
gaggacccat 
ctgctcgccc 
acctactgca 
gaccaggatg 
tcctacggca 
ggtgcctacc 
tttgattcca 
tgtgtcactc 
tggaaacgcc 
gaggccctca 
agaaattgga 
gacagcgagg 
aacaccacca 
gcagagtccc 
acggaccagg 
accttggatg 
tcgtttgtga 
atgcaccact 
gaattcatcg 
ctgaaactga 
tcagagaaga 
cccccagatg 
aagcactgtt 
attcaaaacg 
gatatcaagg 
atccttcgat 
ataaatctca 
atattcaatt 
atcctacaga 
aaacagacca 
ctgacatgcc 
catctgaaaa 
acttacgaat 
gaagctctga 



ggaatatcga 
catgcgagga 
agggcttcga 
actcagacca 
acatgaacga 
catcagtgca 
tgccccagaa 
actatgagga 
gccagtggtc 
ggttcagctc 
ggtttgatac 
tgccgtattt 
gctggtgcgt 
agcaaggctg 
agaagcgctg 
agaagctcaa 
aggagaatgg 
cagaagatgc 
agatccagga 
tggggctgat 
tcatcacggc 
ggataaccct 
tgagaggatg 
agaaacggtg 
acgcgctcaa 
agaagatatt 
accggctcta 
tgactgacac 
agaactgcct 
acacccatca 
acttgctgaa 
ccctgcagca 
cagggcatga 
acaaagtgec 
tgagctgcac 
ggatacggga 
ctcttaagaa 
tccacaggca 



gcggtccctg 
gaagcccaac 
a gccgacgac 
gcgaacaagt 
cacggtggtg 
ggactccggg 
cgctggggac 

cggtgccatc 

ccccgactac 
tgagggggcg 
agatgatgag 
ccacagcttt 
cctcaaggat 
gctccacaaa 
gtttgtcctc 
gggcaccgta 
gatcgacatc 
cagccagtgg 
gatgcatgat 
tgattctgtg 
caaccgggtg 
gctgcagagg 
gttgcacaaa 
gtttgtactc 
actggggacc 
caaagagaca 
caccaagctg 
caaggccccg 
gaactcggat 
ccccttgcac 
agacaaaggc 
actggagtcc 
cctgcgacct 
ccaccccggc 
cttcctgccg 
acagtttcca 
aaccaaatgc 
ggaaatgaca 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 j 
3660 I 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
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•t^acggtct attgccatgg cggcggc tcc tgcaagatca ccatcaactc ccacaccac c- 

OfThctnrrrrarrrr k- „~.i 



- - -~~v*v,v,^, «-La^auuacc 

gctggggagg tggtggagaa gctgatccga ggcctggcca tggaggacag caggaacatg 
tttgctttgt ttgaatacaa cggccacgtc gacaaagcca ttgaaagtcg aaccgtcgta 
gctgatgtct tagccaagtt tgaaaagctg gctgccacat ccgaggttgg ggacctgcca 
tggaaattct acttcaaact ttactgcttc ctggacacag acaacgtgcc aaaagacagt 
gtggagtttg catttatgtt tgaacaggcc cacgaagcgg ttatccatgg ccaccatcca 
gccccggaag aaaacctcca ggttcttgct gccctgcgac tccagtatct gcagggggat 
tatactctgc acgctgccat cccacctctc gaagaggttt attccctgca gagactcaag 
gcccgcatca gccagtcaac caaaaccttc accccttgtg aacggctgga gaagaggcgg 
acgagcttcc tagaggggac cctgaggcgg agcttccgga caggatccgt ggtccggcag 
aaggtcgagg aggagcagat gctggacatg tggattaagg aagaagtctc ctctgctcga 
gccagtatca ttgacaagtg gaggaaattt cagggaatga accaggaaca ggccatggcc 
aagtacatgg ccttgatcaa ggagtggcct ggctatggct cgacgctgtt tgatgtggag 
tgcaaggaag gtggcttccc tcaggaactc tggttgggtg tcagcgcgga cgccgtctoc 
gtctacaagc gtggagaggg aagaccactg gaagtcttcc agtatgaaca catcctctct 
tttggggcac ccctggcgaa tacgtataag atcgtggtcg atgagaggga gctgctcttt 
gaaaccagtg aggtagtgga tgtggccaag ctcatgaaag cctacatcag catgatcgtg 
aagaagcgct acagcacgac acgctccgcc agcagccagg gcagctccag g 



Table 10 hMXl polypeptide sequence (SEQ ID NO:10) 

JJ M cys Leu Gin Gly ^F^y ,al Up Leu Arg Glu Asn Gly Gin His" 



10 15 



Phe Pro ser Thr Val Asn Ser Cys Ala Glu Gly xie Val Val Phe Arg 

25 30. 
Thr Asp Tyr Gly Gin Val Phe Thr Tyr Lys Gin Ser Thr He Thr His 

40 45 

Gin Lys val Thr Ala Met His Pro Thr Asn Glu Glu Gly Val Asp Asp 

55 60 

Met Ala ser Leu Thr Glu Leu His Gly Gly Ser He Met Tyr Asn Leu 

70 7 5 80 

Phe Gin Arg Tyr Lys Arg Asn Gin lie Trp Thr Tyr He Gly Ser He 

85 90 95 

Leu Ala Ser Val Asn Pro Tyr Gin Pro He Ala Gly Leu Tyr Glu Pro 

105 110 

Ala Thr Met Glu Gin Tyr Ser Arg Arg His Leu Gly Glu Leu Pro Pro 

120 125 



5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6171 
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His s; ™ M IIe Ma Glu-^^^^r^^ 

135 140 
His Asp Asn Gin Cys lie Leu He Lys Gly Glu Ser Gly Wa Gly Lys 



155 



160 



Thr Glu Ser Thr Lys Leu lie Leu Lys Phe Leu Ser Val Il e Ser Gin 

±/U 175 

Gk SI L6U S6r LeU ^ I*- ^ Ser Cys Val Glu Arg 

185 190 

Ala lie Leu Glu Ser Ser Pro lie Met Glu Ala Phe Gly Asn Ma Lys 

^uo 205 

Thr val Tyr Asn Asn Asn Ser Ser Arg Phe Gly Lys Phe Val Gin Leu 

220 

Asn He Cys Gin Lys Gly Asn lie Gin Gly Gly Arg He Val Asp Cys 

235 240 
He Leu ser Ser Gin Asn Arg Val Val Arg Gin Asn Pro Gly Glu Arg 

250 255 
Asn Tyr His lie Phe Tyr Ma Leu Leu Ala Gly Leu Glu His Glu Glu 

265 270 
Arg Glu Glu Phe Tyr Leu Ser Thr Pro Glu Asn Tyr His Tyr Leu Asn 

<*B0 285 

Oln Ser Gly Cys Val Glu Asp Lys T*r He Ser Asp Gin Glu Ser Phe 

295 300 

Arg Glu val He Thr Ala Met Asp Val Met Gin Phe Ser Lys Glu Glu 

315 

Val Arg Glu Val Ser Axg Leu Leu Ala Gly He Leu His Leu Gly Z 

330 335 

He Glu Phe lie Thr Ala Gly Gly Ala Gin Val Ser Phe Lys Thr Ala 

345 350 

Leu Gly Arg Ser Ala Glu Leu Leu Gly Leu Asp Pro Thr Gin Leu Thr 

360 365 

Asp Ala Leu Thr Gin Arg Ser Met Phe Leu Arg Gly Glu Glu He Leu 

375 380 
Thr Pro Leu Asn Val Gin Gin Ala Val Asp Ser Arg Asp Ser Leu Ala 



395 



400 



Met Ala Leu Tyr Ala Cys Cys Phe G i u Trp Val Ile 



405 



410 



Lys Lys lie Asn 
415 



Ser Arg He Lys Gly Asn Glu Asp Phe Lys Ser He Gly He Leu Asp 

425 430 
^^-gil^Glu^sn^e Glu Val Asn His ph „ mn m , _ ^ ^ 
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435 



440 



445 



He Asn Tyr Ala Asn Glu Lys Leu Gin Glu Tyr Phe Asn Lys His Xl e 

455 460 
Phe ser Leu Glu Gin Leu Glu Tyr Ser Arg olu Gly Leu ^ Trp ^ 

* /:> 480 
Asp lie Asp Trp ii e ^ Asn Qly ^ ^ ^ ^ ^ ^ ^ ^ 

490 495 
Lys Leu Gly Leu Leu Ala Leu lie Asn Glu Glu Ser His Phe Pro Gin 



505 



510 



Ala Thr Asp Ser Thr Leu Leu Glu Lys Leu His 



515 



520 



Ser Gin His Ala Asn 
525 



Asn His Phe Tyr val Lys Pro Arg Val Ala Val Asn Asn Phe Gly val 

535 540 

Lys His Tyr Ala Gly Glu Val Gin Tyr Asp Val Ar 9 Gly Iie Leu Glu 

555 560 

Lys Asn Arg Asp Thr Phe Arg Asp Asp Leu Leu Asn Leu Leu Arg Glu 

570 575 

Ser Arg Phe Asp Phe lie Tyr Asp Leu Phe Glu His Val Ser Ser Arg 

585 590 

Asn Asn Gin Asp Thr Leu Lys C^s Gly Ser Lvs Hi* a™ * t> 

595 An Y Hls Arg Arg Pro Thr 

600 605 

val ser Ser Gin Phe Lys Val Asp Ser Leu His Ser Leu Met Ala Thr 

615 620 
Leu Ser Ser Ser Asn Pro Phe Phe Val Arg Cys lie Lys Pro Asn Met 

635 640 
Gin Lys Met Pro Asp Gin Phe Asp Gin Ala Va! Val Leu Asn Gin Leu 

650 655 
Arg Tyr Ser Gly Met Leu Glu Thr Val Arg lie Arg Lys Ala Gly Tyr 

665 670 
Ala val Arg Arg Pro Phe Gin Asp Phe Tyr Lys Arg Tyr Lys Val Leu 

O 80 



685 



Met Arg Asn Leu Ala Leu Pro Glu Asp Val Arg Gly 



695 



700 



Lys Cys Thr Ser 



Leu Leu Gin Leu Tyr Asp Ala Ser Asn Ser Glu Trp Gin Leu Gly L ys 

715 72Q 
Thr Lys Val Phe Leu Arg Glu Ser Leu Glu Gin Lys Leu Glu Lys Arg 



Arg Glu Glu Glu Val Ser His Ala Ala Met 



740 



745 



735 

Val He Arg Ala His Val 
750 
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Leu Gly Phe Leu Ala Arg Lys Gin Tyr Arg Lys Val Leu Tyr Cys Val 
755 76 ° 765 

Val lie lie Gin Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe 

775 780 

Leu His Leu Lys Lys Ala Ala lie Val Phe Gin Lys Gin Leu Arg Gly 
790 795 800 

Gin lie Ala Arg Arg Val Tyr Arg Gin Leu Leu Ala Glu Lys Arg Glu 

805 810 815 

Gin Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys Lys Arg Glu 

° Z{J 825 



830 



Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu 



835 



840 



845 



Leu Arg 



Ala Gin Gin Glu Glu Glu Thr Arg Lys Gin Gin Glu Leu Glu Ala Leu 

855 860 

Gin Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu Glu Lys Gin 



870 



875 



880 



Lys Glu Asn Lys Gin Val Glu Glu lie Leu Arg Leu Glu Lys Glu lie 



885 



890 



895 



Glu Asp Leu Gin Arg Met Lys Glu Gin Gin Glu Leu Ser Leu Thr Glu 



905 



910 



Ala Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu Leu 



915 



920 



925 



Arg Arg 



Leu Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu Glu Ser Leu 



940 



Asn Phe Asp Glu He Asp Glu Cys Val Arg Asn He Glu Arg 
950 



955 



Ser Leu 
960 

Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu 
965 970 975 

Glu Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu Glu Glu Val 
980 985 

Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn 
995 1000 1005 

Pro Ser Glu His Gly His Ser Asp Gin Arg Thr Ser Gly He Arg 

1015 1020 

Thr Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr 
1025 1030 1035 

IXo ^ ? 6r - Ma Ser ^ Val Leu Ala 



1045 



1050 
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Pr ° ;g VluiBAapserOly Ser Leu H xs Asn Ser Ser Ser Gly 

1060 1065 

Glu Ser Xhr Tyr Cys Met Pro Gin Asn Ala Gly Asp Leu Pro Ser 

i075 1080 

| Pro Asp Gly Asp Tyr Asp Tyr Asp Gin Asp Asp Tyr Glu Asp Gly 

1090 1095 

! Ala lie Thr Ser Gly Ser Ser Va! Thr Phe Ser Asn Ser Tyr Gly 

1105 111Q 

Ser Gin Trp Ser Pro Asp Tyr Arg Cys Ser Val Gly Thr Tyr Asn 

1120 1125 

Ser Ser Gly Ala Tyr Arg Phe Ser Ser Glu Gly Ala Gln Ser Ser 

1135 1140 

Phe Glu Asp ser Glu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp 

1150 1155 

[ Asp Glu Leu Ser Tyr Arg Arg Asp Ser Val Tyr Ser Cys Val Thr 

1165 117Q 

Leu Pro Tyr Phe His Ser Phe Leu Tyr Met Lys Gly Gly Leu Met 

1180 1185 

I ttlo ^ T" K Val ^* «» Thr Phe 

1195 1200 

Leu Trp Phe Arg Ser Lys Gin Glu Ala Leu Lys Gin Gly Trp Leu 

1210 1215 

His Ly^ Lys Gly Gly Gly Ser Ser Thr Leu Ser Arg Arg Asn Trp 

J-^b 12 30 

Lys Lys Arg Trp Phe Val Leu Arg Gin Ser Lys Leu Met Tyr Phe 

1240 1245 

Glu Asn^ Asp ser Glu Glu Lys Leu Lys Gly Thr Val Glu Val Arg 

1255 1260 

I Thr Ala Lys Glu He He Asp Asn Thr Thr Lys Glu Asn Gly rie 

1270 12?5 

I Seo Ma HiS L6U Ile Ma 61 » 

1285 1290 

Pro Glu_ Asp Ala Ser Gin Trp Phe Ser Val Leu Ser Gin Val His 

1300 1305 

' Ma mo ^ 61U M6t His AS P G1 " «1« Ala 

1315 1320 

Asn Pr^ Gin Asn Ala Val Gly Thr Leu Asp Val Gly Leu lie Asp 

1330 1335 



Ser Val Cys Ala Ser Asp Ser Pro Asp ^ Prn Q _ phe 



Val 
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He lie Thr Ala Asn Arg Val Leu His Cys Asn Ma Asp Thr Pro 

1360 13 g 5 

| Glu Glu 0 Met His His Trp Xle Thr Leu Leu Gln ^ Ser Lys ^ 



1380 



| Asp J, Arg val Glu flly Qln ^ ^ ^ ^ ^ 

1390 1395 

[ His jj. Olu val Lys Asn ser Pro Lys Met Ser Ser Leu Lys Leu 

14 05 141Q 

I *** £. Arg T*p p he Val Leu Thr His Asn Ser Leu Asp Tyr Tyr 

1420 1425 

1 ^ ttlo ^ ^ ^ ^ ^ L - 

1435 1440 

I Sis L6U Ser Val £1 Asp Glu Lys He P he Lys 

1450 1455 
Glu Th^ Gly Tyr Trp ^ v al Thr Val ^ flly ^ Lys ^ 



1470 



Ss *** ^ ^ S 0 ^ 61u thr Ar g Trp Ser 

' ser ISo Ile Gln Asn Val 5f s ^ ^ Lys Ma Pro lle *•* Ti ~ 

1495 1500 
' Pro Th^ Gln Gln Leu Xle Gln Asp Ile Lys G l u Asn ^ Leu ^ 

1510 1515 

Ser As^ val Val Glu Gln lie Ty, Lys ^ Asn prQ Jle ^ 

1525 1530 

Tyr Th^ His His Pro Leu His Ser Pro Leu Leu pro Leu pro 

1540 1545 

Gly As^ He Asn Leu Asn Leu^ Leu Lys Asp Lys Gly Tyr Thr Thr 



1560 



L eu oin Asp Glu ^ ne ne phe ^ ^ ^ ^ 

1570 15?5 

Glu j. r Met Ser Asp Pro xie Pro He Xle Gln Gly xle Leu Gln 

1585 15go 

Thr Gly His Asp Leu Arg Pro Leu Arg Asp Glu Leu Tyr Cys Gln 

1600 1605 

Leu Sxo Lys Gln Thr Asn 2I 5 val Pro His Pro Gly ser Val «y 



1620 



ASP xS s ^ ^ ^ GlD SJ 0 LeU ^ *« ser cys Thr Pne 



1635 
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Le u £ s „ „ 01y ne tys ^ ^ l ^ ^ 

1645 1650 
m A, g Glu 01» Phe Pro o ly «, 81u - Glu Ly= Ma 

■ iOt>0 1665 

Leu Phe Thr Tyr Glu Ser Leu Lys Lys Thr Tvn * 

1670 y iys Tlur °ys Arg Glu Phe 

"""^ 1680 
V.! J» s« Ar g Asp 01tt Mu Ma l<u ne 

" U, *° 1695 
« Ssr * V.X ^ ^ His Gly 0Jy My s „ 

i/U1J 1710 
*r Ile s Asn Ser H is Thr ^ Ala Gly ^ ^ ^ ^ 

■ L/ ^ U 1725 
Xle Arg^ GIy Leu Ma Mefc ^ ^ ^ ^ ^ 

■ J " / " 1740 
Phe jta ^ _ Gly Mp ^ ua n= oitt ^ 

X/:>U 1755 
Val Val Q Ala Asp val Leu Lys pfae ^ ^ ^ 

1765 1770 

Ser Glu val Gly Asp Leu Pro Trp Lys Phe Tvr x 

1775 17ftn p . ^ Plie ^ Phe Lys Leu Tyr 

1780 1785 

Jta Leu Asp Thr Asp j_ „ sl Pro Lys ^ fc ^ ^ 

■ L/y!> 1800 
«. J* „st P te <n„ Mn „ „ ie „ u , vu n= m> 

OXU 1815 
His PrO Q Ma Pro Glu alu Asn Leu Qln yal ^ Ma Ma ^ 

1825 1830 
- -ta «r Ls„ 01 „ Gly Asp ^ t le „ His ^ „, pc<> 

°* U 1845 
»» £ Olu olu Val ^ s ^ G1 „ ^ Lea ^ Mj 

1855 1860 

Ser Gin Ser Thr Lys Thr Phe Thr p™ 

1865 ,„„ Thr Pro ^ G1 « Arg Leu Glu Lys 

/u 1875 

Arg Arg Thr Ser Phe Leu Glu Glv T 

1880 Gly Thr Leu Arg Arg Ser Phe Arg 

1885 1890 

S=t VU " 31 *» S° *■ ™ O. «. Glu Glu Met le „ " 

1905 

Asp Met Trp H e Lys Glu Glu Val s*>r » n , 

1910 VaA Ser Ser Ala Arg Ala Ser lie 

" 1920 
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He jjp Lys Trp Arg Lys jLu Cln uly Met Asn Gin Glu Gin Ala" 

1930 1935 

Met ji. Lys ^ Met Ma ne Lys Glu ^ ^ ^ 

1945 1950 

Ser Th^ Leu Phe Asp Val Glu Cys Lys Glu Gly Gly Phe Pro Gln 

1960 1965 

Glu Trp Leu Gly Val Ser Ala Asp Ala Val Ser Val Tyr Lys 

1975 1980 
Arg Gl^ Glu Gly Arg Pro Leu Glu Val Phe Gin Tyr Glu His lle 

1990 1995 

Leu B jr Pae Gly Ala Pro Leu Ala Asn Thr Tyr Lys Xle Val Val 

2005 2010 



Asp Glu s Arg Glu Leu Leu Phe Glu Thr Ser Glu Val Val Asp Val 

^020 2025 

Ala Ly^ Leu Met Lys Ala Tyr xle Ser Met Xle Val Lys Lys Arg 



2035 2040 



Tyr j« Thr Thr Arg Ser Ala Ser Ser Gin Gly Ser Ser Arg 



2050 2055 



Table 11 hMX2 nucleotide sequence (SEQ ID NO- 1 1) 
agctagtatc ttttattgtc agaacttctg tgagccaaca aacagttttg catggttgta 
cacaaaggga caaggcaaat ttcttttttc gtgtgggtag acttagttgg cccaagtcct 
taaaactttt ccatataaaa ataaaaagtc caagaccaga ttatttttct tctggtcata 
aatgctgatt tatttacagg tgccttgttc agaccaccat tataaacttg ggataaaata 
tgtgtgtatt aaagcctcag catttaatgt cagggtcctt tgaagattca ctcaagtgtt 
aagacgtttc tggaatgcag cgtctctccc ccatagtcaa catggttatt atatctgtaa 
tctatccaga atgatagaag ctaaccttcc aagtaacact ttgtttttaa cttaaatctt 
ttagacatga aagactccaa aatgacttca ttcttgttct aaaaccagca ctggagccag 
ctgttgaaga gtggtttata aatacagtta tcttgtaggc tgcttatctg tttataatac 
agcagacaca gatggcagac tttgctacat gtaaaacaat ggagtcaaca cgtgtttttc 
aaaatacagc aaagacagga aaatccagga tttgggtttg ttaataaaac caccttataa 
agtaacaatt gagactatag ctctgcatta ttaaaatata cagactgtgt acaccattac 
acatcctttt tccctttgct ttttaatgct catgaaacca tgattaaggt gttgagttta 
tgaacacatg cacgaacagg caagcacgta cacttaaaag atgaaacaaa gaaaaaagtt 
gattcatgtc attccatgag aaaggctgcc cgcagcactc cagctcaaac acactgtccc 
ctcgagctct ccatccccct tcccactccc tcaccttccc tcagattcgg ggaaatcagg 
ttgggaggtt agtgcatcat tgacagagaa tgcccccctt ccacgctctg ttaagtctcc 
cccagaaggg ggaaaggcag ttcccttcag tagcacagtt acgqtccatt ag t g ttggtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ccacaagtta 
aaggcatgtg 
cagcttaata 
agaccagaag 
actggagcca 
tcccgccttc 
ttcacgatca 
tcaaagagca 
aaagagagga 
acggagacgg 
cactggtggg 
tctaaaccat 
tgtgcatgca 
atttctaact 
tcttgctttg 
gaagtggcct 
aagccaagtc 
gttagtcttc 
gagcagttct 
cagggacttt 
ccccatatga 
taaattggag 
catcctccat 
ctctgtcgcc 
tgagttcaag 
accacgccca 
gatctcctga 
gagccaccgc 
aggaagccaa 
ctttgtacaa 
ccaatcaaaa 
gaattttata 
aaaatggaag 
aagattttta 
aacagccagg 
agaaaaaaag 
acaattagta 
tgggttgatg 



aggcacttcc 
tcctctaaga 
caggatcaat 
cttccagaaa 
gcctaggcca 
acctggagct 
tgctgatgta 
gctccctctc 
tgtgttcata 
cgtccgcgct 
caagagtcaa 
aagacaggct 
tgcatggagg 
agttaatctc 
actctccact 
ggaaacgtac 
tactcttgta 
aaatacaaaa 
ggctcctctt 
accagtttaa 
aggggctaac 
acgctgaaaa 
gcaaaagcca 
caggctggag 
cgattctcct 
gctaattttt 
cctcatgatc 
gcccggccaa 
gagtttaact 
agcacctgat 
acatcaccta 
cagcctcaat 
ctaatctaga 
gactttgtgg 
gaatatatct 
gaggcaggca 
agatttcttt 
ccaatggcta 



ggctgctttg 
gtagtaaagc 
gaaggcggca 
gcctgggcag 
gagggtggtg 
gccctggctg 
ggctttcatg 
atcgaccacg 
ctggaagact 
gacacccaac 
cagaagagtt 
ttgagtgaag 
ggcctgaaat 
acgcttttgg 
tctcagcatt 
atgcttggcc 
ttaatatctc 
tctaggatca 
cacaaggagc 
ctgtagactt 
aacacagggc 
gagtctctag 
gtagctttct 
tgcagtggcg 
gtctcagcct 
tgcagagatg 
cgcccgcctt 
gccagtagct 
gctattatct 
ttttttatac 
atgaaaagcc 
ataaagtcat 
cgatgagctg 
atcacatggt 
aaaggaatga 
agatctgacc 
tcaataagcg 
aattccatta 



gtggcagcgt 
tttggaaact 
ggcaaaagga 
ctctgtgttt 
cgttcaggta 
ctggcggagc 
agcttggcca 
atcttatacg 
tccagtggtc 
cagagttcct 
aagtcatgaa 
tcctctgtgc 
atgaaaaatg 
gaaaacctca 
attctactag 

gggggactta 

cagttctgcc 
cagaggaaaa 
agcaatggcc 
tcctgtacag 
tgatccaaac 
tttatatccc 
ttttttcttt 
tgatcctggc 
ctcgagtagc 
gggtttcacc 
ggcctcccaa 
ttctatgcta 
attccttgtc 
ttaaaaacag 
acccacggat 
cagatatacg 
gcacacttat 
ctctgtcaca 
gcttggctgt 
cacagaccag 
gtctacttaa 
cgagatagac 



ggttcctccc 
gtgcagactg 
tcctcggaga 
gttttggctg 
gcaaagacag 
gtgtcgtgct 
catccaccac 
tattcgccag 
ttccctctcc 
gagggaagcc 
gtggttggca 
agaagattaa 
gcacctctct 
ctaactggca 
ctgtttggat 
agaaagcttc 
tccaatcctc 
ttctccaaat 
ttccatatgc 
attggtggaa 
ctggacaagc 
taataaccag 
ttttttttga 
tcactgcaac 
tgggattaca 
gtgttagcca 
agcgctggga 
attcacagct 
agggagaaat 
gcataattga 
tctagaattt 
ctgaattact 
ctgttaaggg 
actactcaac 
gttccaataa 
tttgccaaac 
aacaaaacaa 
attcttcctt 



ctcctttttt 
ttaaagttga 
cacctccctc 
ggcatggcac 
gtgggctctg 
gtagcgcttc 
ctcactggtt 
gggtgcccca 
acgcttgtag 
accttccttg 
acagaaagca 
atatattcga 
ggctatcttg 
gagtctaaca 
tagctacgtg 
cctgcaaccc 
tttgcggatg 
cacgcatgct 
agagtgggaa 
gaaaataaga 
a 9gagggcta 
acattctctg 
cggagtctca 
ctccacctcc 
ggtgcatgcc 
ggatggtctc 
ttacaggcat 
cacgttttgc 
ggaattatgg 
accaaccaaa 
ataatattta 
gtgatcataa 
ctgcatagta 
tctgtacaaa 
aactttgttt 
tctcatctag 
aaatcagtac 
tcaaacagat 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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ggctgtaaag 
cagctataac 
cttccctaag 
tacagatgca 
tacaattttg 
tctccacatg 
cacatggcag 
cttgatcaag 
ccacttgtca 
catctgctcc 
ggtcccctct 
I ggttgactgg 
gatggcagcg 
ctggaggttt 
aaacataaat 
aagtttgaag 
aaacttggct 
gttgtattca 
cttctccacc 
gccatggcaa 
tc gggaaggc 
gtatttttcc 
atacttgaga 
gctgtacagg 
1 gcagtacagc 
tattggaatt 
ctcatcctga 
cggaaggggc 
cttgtaaatc 
ctgctgggtg 
ccaccgggtg 
cacggtgacg 
I gcagaggctg 
gtaatccagg 
ctttggactg 
cactctggtg 
cgtgtcggcg 
atcagggctg 



aaaaaacaaa 
atttttt,taa 
gaagggtaga 
tttactacac 
ggggctcgaa 
aacttggccg 
ccagttctca 
gccatgtact 
atgatactgg 
tcctcgacct 
aggaagctcg 
ctgatgcggg 
tgcagagtat 
tcttccgggg 
gcaaactcca 
tagaatttcc 
aagacatcag 
aacaaagcaa 
acctccccag 
tagaccgtgg 
acaaactctc 
atctcggttc 
atccctcgac 
ttgcccacac 
tcgtcccgca 
gggtcagaca 

agggtggtat 

aggagcgggg 
tgttccacca 

ggggtgtcga 

gcctcgttga 
ttccagtagc 
ttgaggacca 
gaattgtggg 
ttcttcacct 
tcccctttgg 
ttgcagtgca 
tcagaggcac 



gtaaaatgca 
atgtagattt 
agaatagatg 
aggagaagag 
aaaaagaggt 
tgatccatgt 
cctccacatc 
tggccatggc 
ctcgagcaga 
tctgccggac 
tccgcctctt 
ccttgagtct 
aatccccctg 
ctggatggtg 
cactgtcttt 
atggcaggtc 
ctacgacggt 
acatgttcct 
cggtggtgtg 
atgtcatttc 
ggcatttggt 
ctggaaactg 
tcggcaggaa 
tgccggggtg 
gaggtcgcag 
tggactccag 
agcctttgtc 
agtgcaaggg 
catccgagtt 
tcggggcctt 
gcagcttggt 
ctgtctcttt 
gggtccccag 
tgagtacaaa 
ctttgtgcaa 
acctctgcag 
gcacccggtt 
acacagaatc 



agtatatcca 
ttatcagtgt 
acatcttaat 
aaactgtgag 
tgagagagca 
tctcagatgc 
aaacagcgtc 
ctgttcctgg 
ggagacttct 
cacggatcct 
ctccagccgt 
ctgcagggaa 
cagatactgg 
gccatggata 
tggcacgttg 
cccaacctcg 
tcgactttca 
gctgtcctcc 
ggagttgatg 
ctgcctgtgg 
tttcttaaga 
ttcccgtatc 
ggtgcagctc 
gggcactttg 
gtcatgccct 
ttgctgcagg 
tttcagcaag 
gtgatgggtg 
caggcagttc 
ggtgtcagtc 
gtagagccgg 
gaatatcttc 
tttgagcgcg 
ccaccgtttc 
ccatcctctc 
cagggttatc 
ggccgtgatg 
aatcagcccc 



aagtttctaa 
ttaaaaaatt 
tttgcattca 
gagaagggag 
aaatgctcca 
cagcacccag 
gagccatagc 
ttcattccct 
tccttaatcc 
gtccggaagc 
tcacaagggg 
taaacctctt 
agtcgcaggg 
accgcttcgt 
tctgtgtcca 
gatgtggcag 
atggctttgt 
atggccaggc 
gtgatcttgc 
atcagagctt 
gattcgtaag 
cttttcagat 
aggcatgtca 
ttggtctgtt 
gtctgtagga 
gaattgaata 
ttgagattta 
tatcgaagga 
tccttgatat 
acgttttgaa 
taacagtgct 
tcatctgggg 
ttcttctctg 
ttcagtttca 
acgatgaatt 
cagtggtgca 
atcacaaacg 
acatccaagg 



tttgtatata 
agatctatag 
ttcctaatat 
gcgttaatgg 
tcttgtcttc 
cccaccccaa 
caggccactc 
gaaatttcct 
acatgtccag 
tccgcctcag 
tgaaggtttt 
cgagaggtgg 
cagcaagaac 
gggcctgttc 
ggaagcagta 
ccagcttttc 
cgacgtggcc 
ctcggatcag 
aggagccgcc 
ctatttcatc 
tgaagagagc 
ggaacttgag 
ggatctgcca 
tgataagctg 
tgccctggat 
tcttgatggc 
tgtccccata 
tcgggttccg 
cttgaatcag 
tgacactgga 
tgcgcccgta 
ggacgacaga 
aactcttgta 
gtgaagacat 
cctggccctc 
tctcctccgg 
agttgggtct 
tgcccacagc 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 . 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
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attctgtggg 
gacctgactc 
gaaagtccta 
ctcttttgcc 
gtacatcagc 
cgtggaggag 
gcggaaccac 
caggccacct 
agagtcacgc 
actatcttca 
gtaggtcccc 
ggtcacgctg 
gtcgccgtct 
gctggaggag 
gtccgcactg 
gtcatcgctg 

gggggagtcc 

gtagggctgg 
ggaaaattcg 
gtcgaaattg 
gcggagctcc 
ctcctgctgc 
ttccacctgc 
gctcttctgc 
gagctcggct 
ttcttcctct 
gtaaactctc 
caggtgcaaa 
acaataaagg 
caccatggcc 
ttctcgaaga 
ctgcagcagg 
cactttatac 
gattctcaca 
aaactggtct 
ggagcttagc 
aggccgccga 
ttcaaaaaga 



tttgcctgct 
agcacgctga 
tcggccataa 
gttcgcactt 
ttggactggc 
cccccccctt 
aagaaggttt 
ttcatgtaca 
cggtatgaaa 
aacgaggact 
acagagcagc 
ctgccggaag 
ggggagggca 
ttgtgtaggc 
gggctggtgg 
gtccggatgc 
ttgaaggcgt 
ctgaagttga 
cttccccccg 
agggactcga 
tggtcccgcc 
tccttcatgc 
ttattttcct 
aaggcttcga 
tctcttcgct 
tcctgtttct 
cgagcaatct 
aatctcctcc 
acctttctgt 

gcgtggctca 

aataccttgg 
ctcgtgcact 
cttttgtaaa 
gtctccagca 
ggcatcttct 
gttgccatta 
tgtttgcttc 
tcgtagataa 



catcatgcat 
accactggct 
tgatgtcgat 
ctacggtgcc 
ggaggacaaa 
ttttgtggag 
catccttgag 
gaaagctgtg 
gctcatcatc 
gcgccccctc 
ggtagtcggg 
tgatggcacc 
agtccccagc 
tcccggagtc 
gcaccaccgt 
cacttgttcg 
cgtcgtcggc 
agttgggctt 
acagggaccg 
ggaactcctg 
gctcctgcag 
gctgcaggtc 
tctgtttctc 
gttcttgctg 
ctctctctct 
tcttttcttc 
gacctctgag 
tcagaaggaa 
attgtttccg 
cttcctcttc 
tcttccccag 
tccctcggac 
agtcctgaaa 
tccctgagta 
gcatgtttgg 
aggaatgcag 
cacatttcaa 
agtcaaaccg 



ctcctggatc 
ggcatcttct 
cccattctcc 
cttgagcttc 
ccagcgcttc 
ccagccttgc 
gacgcaccag 
gaaatacggc 
tgtatcaaac 
agagctgaac 
ggaccactgg 
gtcctcatag 
gttctggggc 
ctgcactgat 
gtcgttcatg 
ctggtctgag 
ttcgaagccc 
ctcctcgcat 
ctcgatattc 
ggccgccctg 
cttctgcagg 
ctcgatttct 
cagttcacgg 
cttcctcgtt 
ttctctttct 
ttgctccctt 
ttgcttctgg 
tgctctgtaa 
tgctaagaag 
cctccgcttc 
ctgccactcg 
gtcctcaggc 
gggtcttcgg 
ccgcagctgg 
cttgatacag 
tgagtcaacc 
ggtatcctgg 
gctttctctt 



tcctggtccg 

ggggactctg 

ttggtggtgt 
tcctcgctgt 
ttccaatttc 
ttgagggcct 
cggcgtttcc 
agagtgacac 
ctggaatcaa 
cggtaggcac 
ctgccgtagg 
tcatcctggt 
atgcagtagg 

ggggcgagca 

tatgggtcct 
tggccgtgct 
tcatcgacct 
gcgctctcag 
cggacacact 
cacgcttcct 
gaagcctcgg 
ttctccagac 
gtcagttcag 
tcttcttcct 
tcttcctccc 
ttctctgcca 
aaaactatgg 
ttcttctgta 
cccaagacat 
tccagtttct 
ctgttggagg 
agagccagat 
accgcatacc 
ttcagcacaa 
cgaacaaaga 
ttgaactgtg 
ttgttgcggc 
agcaaattga 



tggacgcgtg 
caatcaggtg 
tatctatgat 
cgttttcaaa 
tcctggacag 
cctgcttgga 
aagagttcat 
agctgtacac 
agtcctcttc 
ccgagctgtt 
agttggagaa 
cgtagtcgta 
tggactcgcc 
gcaccgtgct 
cctctgaaga 
cgctggggtt 
cctcctctgg 
ccagctcgct 
cgtcgatctc 
cctccagcct 
tcagcgacag 
ggaggatctc 
cttccttctg 

gctgggcgcg 

gtttcttctt 
gcaattgtct 
ctgccttttt 
ttatcaccac 
gggcccgaat 
gttccaagga 
catcatagag 
tcctcatcag 
cagctttgcg 
ccgcctggtc 
aaggattaga 
agctgactgt 
ttgaaacatg 
gaaggtcatc 



5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
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tcgaaatgta 
atagtgcttc 
cgcatgctga 
ttcttcattg 
attgtctatc 
taaagaaaaa 
gaactgttca 
agacttgaag 
gcagcacgca 
gagaggcgtg 
gagctgtgtt 
aacctgtgcc 
cagcctcgac 
aacttcccta 
caagtagtgg 
ccctgccagc 
tcggttctgg 
acagatgttc 
cgcattgccg 
tgtcttctcc 
gatcaattta 
gttgtcgtgg 
gggcagctcg 
ggcgatgggc 
atttctctta 
ggacgccatg 
ggtaattgtg 
gccttctgca 
ccgtgttccc 



tctctgttct 
actccaaaat 
ctgtgtagct 
ataagggcta 
cagtcaatat 
atatgcttgt 
aagtgattaa 
tcctcattgc 
tacagagcca 
aggatctctt 
gggtccagcc 
ccaccagcag 
acttcccgaa 
aaggattcct 
tagttttctg 
agtgcataaa 
gaagagagga 
agctgaacaa 
aaagcttcca 
tttaaggaca 
gtgctttcgg 
cgcttccaca 
cccaggtggc 
tggtaggggt 
taccgctgga 
tcatccacgc 
ctctgcttgt 
caggaattta 
tgtaaacaaa 



tctccaagat 
tgttaactgc 
tctccaataa 
ggaggccaag 
cttcccacac 
tgaagtactc 
cctcaaagtt 
ctttgatcct 
tggccaggga 
ctcccctgag 
caagtaactc 
tgataaattc 
cttcctcctt 
ggtcactgat 
gcgtagataa 
atatgtgata 
tacaatctac 
acttcccaaa 
tgatggggct 
attccaaaga 
ttttacctgc 
ggcagcggta 
gccggctgta 
tcacagaggc 
ataagttata 
cctcctcgtt 
aagtgaatac 
cagtacttgg 



acctcggaca 
aactctgggc 
ggtgctgtct 
tttcttctca 
taatccttcc 
ctgaagtttc 
ttcaaatcca 
gctgttgatc 
gtccctgctg 
gaacattgat 
cgcagatctg 
tatgttccca 
gctgaactgc 
tgtcttgtct 
ataaaattct 
attcctttcc 
aattctcccg 
gcgactagag 
gctttcaaga 
ctgttgactg 
cccactttca 
gcactcgttg 
ctgctccatg 
caggatggag 
catgatggag 
cgtggggtgc 
ctgaccatag 
aaaatgctgg 



tcatattgca 
ttcacataaa 
gtggcttgag 
atcaagtcca 
ctgctatatt 
tcgtttgcat 
aagatgtcga 
ttcttgatta 
tctactgcct 
ctctgggtca 
cccaaagctg 
agatgcagta 
atcacgtcca 
tctacacatc 
tctctttctt 
ccgggatttt 
ccctgaatat 
ttgttgttgt 
atagctcgtt 
atgactgaca 
cccttgatga 
gcgatggcga 
gtggcaggct 
ccgatgtagg 
ccgccatgga 
atagcagtca 
tctgtccgga 
ccattttctc 



cctctccagc 
agtggttatt 
gaaaatggct 
ggcattctcc 
ctagttgttc 
agtttatatt 
ggatgccaat 
cccactcaaa 
gttgaacatt 
aagcatctgt 
ttttgaagga 
taccagcaag 
ttgccgtaat 
cagactgatt 
catgttccag 
gccttactac 
ttcctttctg 
acacggtctt 
caacacagga 
gaaacttgag- 
ggatgcactg 
agatgtgcgg 
cgtacagccc 
tccatatttg 
gctctgtcaa 
ccttctggtg 
agacgacgat 
tcagccagac 



Table 12 KM X2 polypeptide sequence (SEQ ID NO:12) 

r^ne Cys Leu Gin Gly Tnr Arg Val Trp Leu Arg oiu Aan Gly Gin Hi s" 



15 



Phe Pro Ser Thr Val Asn Ser Cys Ala Glu Gly lie Val Val Phe Arg 
20 25 30 

Thr Asp Tyr Gly Gin Val Phe Thr Tyr Lys Gin Ser Thr lie Thr His 

5 40 45 

Gin Lys Val Thr Ala Met His Pro Thr Asn Glu Glu Gly Val Asp Asp 



7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9621 



60 
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Met Ala Ser Leu Thr Glu Leu His Gly G ly Ser 



70 



75 



He Met Tyr Asn Leu 



80 



Phe Gin Arg Tyr Lys Arg to Gln Ile Trp Tfar ^ ^ ^ ^ ^ 

90 95 

Leu Ala Ser Val Asn Pro Tyr Gin Prn ti« b i ™ 

100 ^ ?™ 116 Ala Leu Tyr Glu Pro 

105 110 
Ma Thr Met Glu Glo ^ s „ ^ Ms ^ ^ ^ 

125 

Hi. lie Phe «a ne Ala As. ol« <*s ^ ^ ^ ^ Itp Lys 

135 140 
His Asp te 61n ^ Le „ Ue ^ ^ ^ ^ M> ^ ^ 

155 160 
Thr 01u Ser ^ L Leu ne ^ ^ ^ ^ ^ ne s ^ 

170 175 
Cln Ser L eu Oiu Leu Ser Leu Lys oiu Lys Thr Ser ^ Val Glu 

185 190 
Ala 1U Leu G l„ sw Ser «. n Het G1 „ „ a phe Gly ^ 

205 

Thr val ^ Mn to ta ^ ^ phe ^ ^ vii . Gin 

ZXb 220 
Asn lie cys Gin Lys ^ Asn Ile ^ Qly Qiy ^ ^ ^ ^ ^ 

235 240 
He Leu Ser Ser Gin Asn Arg Val Val to , « 

245 9 ln Asn Pro G1 y Glu Arg 

250 



Asn Tyr His lie Phe Tyr Ala Leu Leu Ala 



255 



260 



265 • 



Gly Leu Glu His Glu Glu 



270 



«. «» Phe ^ ^ S5r ^ ^ 01 „ _ ^ ^ ^ 

280 285 
01. *j 01 y cys val <n„ j, Ly . ^ 1U s „ A<p ^ ^ 

^ 5 300 
j£ Clu Val Zle Thr Ma Met Asp Val Met Gin Phe Ser Lys Glu Glu 

315 320 

v.l *, Glu „ g «, Leu _ Ua 0ly ne ^ Ms ^ ^ ^ 



330 



He Glu Phe lie Thr Ala Gly Gly Ala Gin Val Ser Phe 



335 



345 



Lys Thr Ala 
350 



Gly Arg Ser Ala Glu Leu Leu Gly Leu Asp Pro Thr Gin Leu Thr 



360 



365 
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Leu 



ASP Thr Gin Arg Ser MeL Phe Leu Arg Gly Glu Glu He 

375 380 

Thr Pro Leu Asn Val Gin Gin Ala Val Asp Ser Arg Asp Ser Leu Ala 

395 400 
Met Ala Leu Tyr Ala Cys Cys Phe Glu Trp Val lie L ys Lys He Asn 
405 410 415 

Ser Arg lie Lys Gly Asn Glu Asp. Phe Lys Ser He Gly lie Leu Asp 

425 430 

He Phe Gly Phe Glu Asn Phe Glu Val Asn His Phe Glu Gin Phe Asn 
5 440 445 

lie Asn Tyr Ala Asn Glu Lys Leu Gin Glu Tyr Phe Asn Lys His lie 

455 460 

Phe ser Leu Glu Gin Leu Glu Tyr Ser Arg Glu Gly Leu Val Trp Glu 



475 



Asp He Asp Trp lie Asp Asn Gly Glu Cys Leu Asp Leu He Glu 



485 



490 



480 



Lys 



495 



Lys Leu Gly Leu Leu Ala Leu He Asn Glu Glu Ser His Phe Pro Gin 

505 510 
Ala Thr Asp Ser Thr Leu Leu Glu Lys Leu His Ser Gin His Ala Asn 



520 



525 



Asn His Phe iyr Val Lys Pro Arg Val Ala Val 



530 



535 



Asn Asn Phe Gly Val 



540 



Lys His Tyr Ala Gly Glu Val Gin Tyr Asp Val Arg Gly lie 



550 



555 



Leu Glu 
560 



Lys Asn Arg Asp Thr Phe Arg Asp Asp Leu Leu Asn Leu Leu Arg Glu 
565 570 575 

Ser Arg Phe Asp Phe lie Tyr Asp Leu Phe Glu His Val Ser Ser Arg 

585 590 
Asn Asn Gin Asp Thr Leu Lys Cys Gly Ser Lys His Arg Arg Pro Thr 



600 



605 



Val Ser Ser Gin Phe Lys Val Asp Ser Leu His Ser Leu Met Ala Thr 



620 

Leu Ser Ser Ser Asn Pro Phe Phe Val Arg Cys He 



630 



635 



Lys Pro Asn Met 
640 



Gin Lys Met Pro Asp Gin Phe Asp Gin Ala Val Val Leu Asn Gin Leu 



650 



655 



Arg Tyr Ser Gly Met Leu Glu Thr Val Arg He Arg Lys Ala Gly Tyr 



665 



670 



Ala Val Arg Arg Pro Phe Gin Tyr Lys Arg Tyr Lvs 



Val Leu 
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675 



680 



685 



Met Arg Asn Leu Ala Leu ^ ^ ^ ^ ^ ^ ^ ^ 

695 700 

Leu Leu Gin Leu Tyr Asp Ala Ser Asn Ser Glu Trp Gin Leu Gly Lys 

715 720 
Thr Lys Val Phe Leu Arg Glu Ser Leu Glu Gin Lys Leu Glu Lys Arg 

730 735 
Arg Glu Glu Glu Val Ser His Ala Ala Met Val He Arg Ala His Val 

745 750 

Leu Gly Phe Leu Ala Arg Lys Gin Tyr Arg Lys Val Leu Tyr Cys Val 

760 765 

Val lie He Gin Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe 



780 



Leu His Leu Lys Lys Ala Ala He val Phe Gin 



785 



790 



795 



Lys Gin Leu Arg Gly 



800 



Gin lie Ala Arg Arg Val Tyr Arg Gin Leu Leu Ala Glu Lys Arg Glu 

810 815 

Gin Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys Lys Arg Glu 

825 830 

Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arg 

840 845 

Ala Gin Gin Glu Glu 'Glu Thr Arg Lys Gin Gin Glu Leu Glu Ala Leu 

855 860 
Gin Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu Glu Lys Gin 

875 880 
Lys Glu Asn Lys Gin Val Glu Glu lie Leu Arg Leu Glu Lys Glu lie 

890 895 

Glu Asp Leu Gin Arg Met Lys Glu Gin Gin Glu Leu Ser Leu Thr Glu 

905 910 

Ala Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu Leu Arg Arg 

920 925 

Leu Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu Glu Ser Leu 

935 940 
Asn Phe Asp Glu lie Asp Glu Cys Val Arg Asn lie Glu Arg Ser Leu 
950 95 5 960 

Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu 

965 970 975 

Glu Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu Glu Glu Val 



985 



990 
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Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn 

" 5 1000 1005 

Pro Ser Q Glu His Gly His Ser_ Asp Gin Arg Thr Ser Gly lie Arg 

Thr ser Asp 'Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr 

1030 1035 

Val mo Pr ° Thr Ser Pr ° Sis *** ASP Ser Thr ^5o L6U Leu Ma 

Pro Ser Val Gin Asp Ser Gly Ser Leu His Asn Ser Ser Ser Gly 
1055 1060 1065 

G1U So ^ Pr ° Gln ASn Ala G1 * AS P L ^ Ser 

1070 1 °75 1080 

Pro Asp_ Gly Asp Tyr Asp Tyr^ Asp Gin Asp Asp Tyr_ Glu Asp Gly 

Ala lie Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly 
1100 x 105 1110 

Ser Gin Trp Ser Pro Asp Tyr Arg Cys Ser Val Gly Thr Tyr Asn 

1120 1125 

Ser Ser Q Gly Ala Tyr Arg Phe^ Ser Ser Glu Gly Ala Gin Ser Ser 

Phe Glu Asp ser Glu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp 

1150 1155 

Asp Glu Leu Ser Tyr Arg Arg Asp Ser Val Tyr Ser Cys Val Thr 

1165 1170 

Leu Pro Tyr Phe His Ser Phe Leu Tyr Met Lys Gly Gly L eu Met 

1180 1185 

Asn Se^ Trp Lys Arg Arg Trp_ Cys Val Leu Lys Asp Glu Thr Phe 



1200 

Leu Trp Phe Arg Ser Lys Gin Glu Ala Leu Lys Gin Gly Trp Leu 

1210 1215 

His Lys Lys Gly Gly Gly Ser Ser Thr Leu Ser Arg Arg Asn Trp 



1215 

Lys Gly Gly Gly Ser Ser- Thr- t.o„ 
1220 ipoc 

1225 1230 

Arg Trp Phe Val Leu Ara Gin t„« 

1235 



Lys Lys Arg Trp Phe Val Leu Arg Gin Ser Lys Leu Met lyr Phe 

1240 1245 

Glu Asn Asp Ser Glu Glu Lys Leu Lys Gly Thr Val Glu Val Arg 

1255 1260 

^ ^65 ^ So ASD Thr Thr L * a Glu *sn Gly He 



1275 
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Asp Ile Ile Met Ala Asp Arg Thr Phe His Leu He Ala Glu Ser 
1280 1285 1290 

Pro Glu Asp Ala Ser Gin Trp Phe Ser Val Leu Ser Gin Val His 
1295 1300 1305 

Ala Ser Thr Asp Gin Glu Ile Gin Glu Met His Asp Glu Gin Ala 
1310 "IS 1320 

Asn Pro Gin Asn Ala Val Gly Thr Leu Asp Val Gly Leu Ile Asp 
1325 1330 1335 

Ser 073 Ala Ser ASP Ser Pro **9 Pro Asn Ser Phe Val 

1340 1345 1350 

Ile lie Thr Ala Asn Arg Val Leu His Cys Asn Ala Asp Thr Pro 
1355 1360 1365 

Glu Glu Met His His Trp Ile Thr Leu Leu Gin Arg Ser Lys Gly 
1370 1375 13 ^ 0 y V 

Asp Thr Arg Val Glu Gly Gin Glu Phe He Val Arg Gly Trp Leu 
1385 1390 13gs 

His Lys Glu Val Lys Asn Ser Pro Lys Met Ser Ser Leu Lys Leu 
1400 1405 j.410 • 

Lys Lys Arg Trp Phe Val Leu Thr His Asn Ser Leu Asp Tyr Tyr 
1415 1420 1425 

Lys Ser Ser Glu Lys Asn Ala Leu Lys Leu Gly Thr Leu Val Leu 
1430 i4 3 5 1440 

Asn Ser Leu Cys Ser Val Val Pro Pro Asp Glu Lys He Phe Lys 
1445 1450 1455 

Glu Thr Gly Tyr Trp Asn Val Thr Val Tyr Gly Arg Lys His Cys 
1460 1465 14?0 

Tyr Arg Leu Tyr' Thr Lys Leu Leu Asn Glu Ala Thr Arg Trp Ser 
1475 i48o 1485 

Ser Val lie Gin Asn Val Thr Asp Thr Lys Ala Pro Ile Asp Thr 
1490 1495 isoo 

Pr ° ^ Gln GlD Leu Ile Gln Ile Glu Asn cys Leu Asn 

1505 1510 1515 

Ser Asp Val Val Glu Gln Ile Tyr Lys Arg Asn Pro lie Leu Arg 
1520 1525 1530 9 

Tyr Thr His His Pro Leu His Ser Pro Leu Leu Pro Leu Pro Tyr 
1535 1540 1545 

Gly Asp Ile Asn Leu Asn Leu Leu Lys Asp Lys Gly Tyr Thr Thr 
1550 1555 1560 

Leu Gln Asp Glu Ala Ile Lys He Phe Asn Ser Leu Gln Gln Leu 
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48 



1565 



1570 



1575 



Glu ser Met Ser Asp Pro He Pro He lie Gin Gly n e Leu Gin 
1580 "85 1590 

Thr Gly His Asp Leu Arg Pro Leu Arg Asp Glu Leu Tyr Cys Gin 
1595 1600 160 5 

Leu lie Lys Gin Thr Asn Lys Val Pro His Pro Gly Ser Val Gly 
1610 1620 

Asn Leu Tyr Ser Trp Gin lie Leu Thr Cys Leu Ser Cys Thr Phe 
1525 1630 1635 

Leu Pr^ Ser Arg Gly lie Leu_ Lys Tyr Leu Lys Phe^ His Leu Lys 

j Arg lie Arg Glu Gin Phe Pro Gly Thr Glu Met Glu Lys Tyr Ala 

I 660 1665 
Leu Phe Q Thr Tyr Glu Ser Leu_ Lys Lys Thr Lys Cys^ Arg Glu Phe 

' 168*5 ^ ^ G1U Ma L6U 116 His Ar 9 Gin Glu 

Met Thr Q Ser Thr Val Tyr Cys^ His Gly Gly Gly jar Cys Lys He 
| Thr Ile s Asn Ser His Thr Thr^ Ala Gly Glu Val Val Glu Lys Leu 

He Arg Gly Leu Ala Met Glu Asp Ser Arg Asn Met Phe Ala Leu 

1735 1740 

Phe Glu Tyr Asn Gly His Val Asp Lys Ala lie Glu Ser Arg Thr 
1745 "50 1755 

V ^ 1*60 Wa J£ 5 LyS ^ G1U LyS ^ U Q M * Thr 

Ser Glu_ val Gly Asp Leu Pro^ Trp Lys Phe Tyr Phe Lys Leu Tyr 

1790 ASP Val Pr ° LyS AS P Ser Glu Phe 

1790 I 7 * 5 1800 

Ala Phe Met Phe Glu Gin Ala His Glu Ala Val lie His Gly His 
1805 181° 1815 

So ° 1U ° 1U ASD L6U ° ln Val Leu ^ *** Arg 

1825 i83o 

Leu Gin Tyr Leu Gin Gly Asp Tyr Thr Leu His Ala Ala lie Pro 
1835 1840 1845 

Pro Leu^ Glu Glu Val Tyr Ser Leu Gin Arg Leu Lys Ala Arg lie 
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Ser Gin Ser Thr Lys Thr Phe THr Pro Cys Glu Arg Leu Glu Lys 
1865 1870 1875 

Arg Arg Thr Ser Phe Leu Glu Gly Thr Leu Arg Arg Ser Phe Arg 

1885 1890 

Thr Gly Ser Val Val Arg Gin Lys Val 'Glu Glu Glu Gin Met Leu 
1895 "00 1905 

Asp Met Trp lie Lys Glu Glu Val Ser Ser Ala Arg Ala Ser lie 
1910 19 15 1920 

116 LyS ^ *** L * S Phe Gln Gly Met Asn Gin Glu Gin Ala 

1925 "30 a9 35 

M6t £?„ ^ ^ L6U 116 LyS Glu Tr P Pro Tyr Gly 

1940 • 1945 1950 *■ X 

Ser Thr Leu Phe Asp Val Glu Val Arg Thr Gly Cys His Val Leu 
55 I 9 " 1965 

?S n TrP His Ldu **** Thr Trp lie Thr Ala Lys 

1970 !975 1980 

Phe Met Trp Arg Glu Asp Lys Met Glu His Phe Ala Leu Ser Thr 
1985 1990 1935 

2S00 *** Ma Pr ° LyS 116 Val Pro Leu Thr *™ *™ Pbe 
2000 2005 2010 



Ser Ser Gin Phe Leu Phe Ser Cys Val Val Asn Ala Ser Val He 
2 °15 2020 2025 

^ SJo ™~ ** 9 075 HiS L6U Phe Phe P ™ 



2035 2040 



Ser Leu Gly Lys Leu 
2045 



Table 13 hMP nucleotide sequence (SEQ ID NO:13) 



ccaacttttg cagctccacc caggatgtgg cctcgctcca ccccagctgt gcgcctctct 
ccacccctag gcgaaggcac tagaatttcc caaattaaga acgaagagga agtttggacc 
ttttcggcca ccgctcgctt caatatggct gcccccaggg agagacgagg ctaccatgaa 180 
ggagccgagc gcagaccctg agtccgtcac ccatggatcg cagcgcggag ttcaggaaat 240 
ggaaggcgca atgtttgagc aaagcggacc tcagccggaa gggcagtgtt gacgaggatg 
tggtagagct tgtgcagttt ctgaacatgc gagatcagtt tttcaccacc agctccttcg 
ctggccgcat cctactcctt gaccggggta taaatggttt tgaggttcag aaacaaaact 420 
gttgctggct actggttaca cacaaacttt gtgtaaaaga tgatgtgatt gtagctctga 480 
agaaagcaaa tggtgatgcc actttgaaat ttgaaccatt tgttcttcat gtgcagtgtc 
gacaattgca ggatgcacag attctgcatt ccatggcaat agattctggt ttcaggaact 



60 
120 



300 
360 



540 
600 



WO 01/70808 



-50- 



PCT7US01/09609 



ctggcataac 

tagaagttcc 

tgttaaatgt 

actgcctaca 

agaaaaataa 

agtgtattac 

tcaatgttac 

atgtttctag 

atcagccatt 

cagaggatgt 

cctgaaatgc 

taaatcaaag 



ggtgggaaag 

attaagccat 

ggcaaatcaa 

gcatgctttg 

ctcatcatat 

taaagaaagt 

catcttccct 

taggtttta.t 

cataagccag 

gcagttgtca 

tatagttatc 

atgtttgaaa 



agaggaaaaa 

aagggaaaac 

aaaatggagg 

gaaagggaaa 

attcataaga 

gatgaagaac 

gaagattact 

aaagctgctc 

taatgacaag 

tctaagctct 

cctacttttt 

aaaaaaaa 



ctatgttggc 

tgatggtgac 

aaaacaagaa 

cgatgactaa 

aaaaaagaaa 

ttgaaaatga 

aagctttggt 

ttcataagag 

tgcagagctt 

cagcagtacc 

taccagtttc 



tgtccggagt 

agaggaatat 

aagaattgag 

cttacatccc 

cccagaaaaa 

tgatgatgat 

tctgatgtgt 

tattttagtt 

caaactataa 

cggcttatcc 

tcccagaagc 



acacatggct 

attgacttcc 

aggttttaca 

aagatcaaag 

acacgtgccc 

gatctaggaa 

cttggccgta 

tgttgagtgt 

ctttgttgcc 

tacgacttca 

acctgcttaa 



Table 14 hMP polypeptide sequence (SEQ ID NO: 14) 



Met Asp Arg Ser Ala Glu Phe Arg Lys T rp Lys Ala Gin Cys Leu Ser 



10 



15 



Lys Ala Asp Leu Ser Arg Lys Gly Ser Val Asp Glu Asp Val Val Glu 



25 



30 



Leu Val Gin Phe Leu Asn Met Arg Asp Gin Phe Phe Thr Thr Ser Ser 
35 40 



45 



Phe Ala Gly Arg lie Leu Leu Leu Asp Arg Gly He Asn Gly Phe Glu 
50 55 60 

Val Gin Lys Gin Asn Cys Cys Trp Leu Leu Val Thr His Lys Leu Cys 
70 75 80 

Val Lys Asp Asp Val lie Val Ala Leu Lys Lys Ala Asn Gly Asp Ala 
85 ^O 95 

Thr Leu Lys Phe Glu Pro Phe Val Leu His Val Gin Cys Arg Gin Leu 
100 105 110 

Gin Asp Ala Gin He Leu His Ser Met' Ala lie Asp Ser Gly Phe Arg 
115 120 125 

Asn Ser Gly lie Thr Val Gly Lys Arg Gly Lys Thr Met Leu Ala Val 
±JSU 135 140 

Arg Ser Thr His Gly Leu Glu Val Pro Leu Ser His Lys Gly Lys Leu 
150 155 160 

Met Val Thr Glu Glu Tyr lie Asp Phe Leu Leu Asn Val Ala Asn Gin 

165 170 175 

Lys Met Glu Glu Asn Lys Lys Arg lie Glu Arg Phe Tyr Asn Cys Leu 



185 



190 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1288 
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51 



Gln His Ala Leu Glu Arg Glu Tar Met Thr Asn Leu Hi 



195 



s Pro Lys lie 



200 205 
Lys Glu Lys Asn Asn Ser Ser Tyr lie His Lys Lys Lys Arg Asn 



215 220 
Glu Lys Thr Arg Ala Gin Cys lie Thr Lys Glu Ser Asp Glu <51u 



230 



235 



Pro 



Leu 
240 



Glu Asn Asp Asp Asp Asp Asp Leu Gly He Asn Val Thr He 
245 250 

Glu Asp Tyr 



Phe Pro 
255 



acgcgtgcag 
tcttccgagt 
actccctcaa 

taagggcaac 

gaaatgagaa 
agatcagcaa 
aaggtgcaag 
a gtaa tccca 
caaaagttcc 
acagagattc 
aacctggaaa 
ctcattttaa 
ttgaagaaca 
ggactccagt 
gctcgcaagg 
tcaagcgctc 
ataatgagca 
tgtctggggg 

cggggaattc 

cagtctccaa 
atgaaccaca 
atcaacatgt 
aaaggaagag 
ggaatgcgaa 
ttcctgtatt 
ttagtcacga 
catgtgctat 



Table 15 NHR nucleotide sequence (SEQ ED NO: 1 5) 



gtggcgtggc 

atcgccggga 
gtacagtgac 
caagttgtta 
tcaggatgaa 
ccaggaagag 
actgtccgtg 
ctgaattcca 
ttctccacca 
aaaggtacct 
aaataaaaga 
ggaaatggag 
caattccatg 
acctccaaga 
ccggtcttgt 
tgctatctct 
taagcgttca 
cacccaaaaa 
tgctgctgtt 
taagaaacca 
caaaggaaag 
caacaaatta 
caacggaaga 
ggggcctcat 
ctcaactttt 
gatctttttc 
gatctctgaa 



gccagggatt tgaaccgcgc tgacgaagtt tggtgatcca 
tttcgaatcg cgatgatcat cccctctcta gaggagctgg 
ctgcagaact tagccaagag tctgggtctc cgggccaacc 
aaagccttga aaggctacat taaacatgag gcaagaaaag 
agtcaaactt ctgcatcctc ttgtgatgag actgagatac 
ctgagagaca gccacttggc catgtcacca aaacaaggag 
tggaccctga ctcacagaga atcattcaga gataaaaata 
gaatcatgaa aagcaggaaa gccaggatct cagagcactg 
gacgagcacc aagaagctga gaatgctgtt tcctcaggta 
tcagaaggaa agaaatctct ctacacagat gagtcatcca 
actgcaatca ctactccaaa ctttaagaag cttcatgaag 
tccattgatc caatatatng aggagaaaaa aagaaacatt 
aatgaactga agcagccgcc catcaataag ggaggggtca 
ggaagactct ctgtggcttc tactcccatc agccaacgac 
ggccctgcaa gtcagagtac cttgggtctg aaggggtcac 
gcagctaaaa cgggtgtcag gttttcagct gctactaaag 
ctgaccaaga ctccagccag aaagtctgca catgtgaccg 
ggcgaggctg tgcttgggac acacaaatta aagaccatca 
attaccccat tcaagttgac aactgaggca acgcagactc 
•gtgtttgatc ttaaagcaag tttgtctcgt cccctcaact 
ctaaaaccat gggggcaatc taaagaaaat aattatctaa 
acttctacaa gaaaacttac aaacaacccc atctccagac 
aacgcgagca agaagaaagg agaagaaagc aaaggttttg 
tttggctgaa gattaataat tttttaacat cttgtaaata 
ttccttttgt aaattttttt tttttgctgt catccccact 
tgctaactgt tcatagtctg tgtagtgtcc atgggttctt 
aagacgttat caccttaaag ctcaaattct ttgggatggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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ttttacttaa 


gtccattaac aattcaggtt 


tctaacgaga cccatcctaa aattctcttt 


1680 


ctagtttttt 


aatgtcacca 


tcccaaactc 


ccgtttctgg atttttaatc cccagctccc 


1740 


cagttccctc 


ttatcgtact 


aatattaaca 


gaactgcagt cttctgctag ccaatagcat 


1800 


ttacctgatg gcagctagtt 


atgcaagctt 


caggagaatt tgaacaataa caagaatagg 


1860 


gtaagctggg atagaaaggc 


cacctcttca 


ctctctatag aatatagtaa cctttatgaa 


1920 


ac ergggccat 


atagtttggt 


tatgacatca 


atattttacc taggtgaaat tgtttaggct 


1980 


tatgtacctt 


cgttcaaata 


tcctcatgta attgccatct gtcactcact atattcacaa 


2040 


aaataaaact 


ctacaactca 


ttctaacatt 


gcttacttaa aagctacata gccctatcga 


2100 


aatgcgagga 


ttaatgcttt 


aatgctttta 


gagacagggt ctcactgtgt tgcccaggct 


2160 


ggtctcaaac 


tccaccaaat 


gtacttctta 


ttcattttat ggaaaagact aggctttgct 


2220 


tagtatcatg 


tccatgtttc 


cttcacctca 


gtggagcttc tgagttttat actgctcaag 


2280 


atcgtcataa 


ataaaatttt 


ttctcattgt 


caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 


2340 


aaaaaaaaaa 


aa 






2352 



Table 16 NHR polypeptide sequence (SEQ ID NO: 1 6) 



Met lie lie Pro Ser Leu Glu Glu Leu Asp Ser Leu Lys Tyr Ser Asp" 
1 5 10 15 

Leu Gin Asn Leu Ala Lys Ser Leu Gly Leu Arg Ala Asn Leu Arg Ala 
20 25 30 

Thr Lys Leu Leu Lys Ala Leu Lys Gly Tyr He Lys His Glu Ala Arg 
35 40 45 s 

Lys Gly Asn Glu Asn Gin Asp Glu Ser Gin Thr Ser Ala Ser Ser Cvs 
50 55 J° 



Asp Glu Thr Glu lie Gin lie Ser Asn Gin Glu Glu Ala Glu Arg Gin 

70 75 80 

Pro Leu Gly His Val Thr Lys Thr Arg Arg Arg Cys Lys Thr Val Arg' 
85 90 95 

Val Asp Pro Asp Ser Gin Gin Asn His Ser Glu He Lys He Ser Asn 
100 105 110 

Pro Thr Glu Phe Gin Asn His Glu Lys Gin Glu Ser Gin Asp Leu Arg 
115 120 M 



125 



Ala Thr Ala Lys Val Pro Ser Pro Pro Asp Glu His Gin Glu Ala Glu 

135 i4 0 

Asn Ala Val Ser Ser Gly Asn Arg Asp Ser Lys Val Pro Ser Glu Gly 



155 



160 



Lys Lys Ser Leu Tyr Thr Asp Glu Ser Ser Lys Pro Gly Lys Asn Lys 
165 170 175 

Arg Thr Ala lie Thr Thr Pro Asn Phe Lys Lys Leu His Glu Ala His 
i 8 - 0 . 185 lgo 
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Phe Lys Glu Met Glu Ser He Asp Pro lie Tyr Xaa Gly Glu Lys Lys 
195 200 205 

Lys His Phe Glu Glu His Asn Ser Met Asn Glu Leu Lys Gin Pro Pro 
210 . 215 220 

lie Asn Lys Gly Gly Val Arc, Thr Pro Val Pro Pro Arg Gly Arg Leu 
225 230 235 9 240 

Ser Val Ala Ser Thr Pro lie Ser Gin Arg Arg Ser Gin Gly Arg Ser 
245 250 255 

Cys Gly Pro Ala Ser Gin Ser Thr Leu Gly Leu Lys Gly Ser Leu Lys 
260 265 270 

Arg Ser Ala lie Ser Ala Ala Lys Thr Gly Val Arg Phe Ser Ala Ala 
275 280 285 



Thr Lys Asp Asn Glu His Lys Arg Ser Leu Thr Lys Thr Pro Ala Arg 
290 295 300 a 

Lys Ser Ala His Val Thr Val Ser Gly Gly Thr Gin Lys Gly Glu Ala 



310 315 



320 



Val Leu Gly Thr His Lys Leu Lys Thr lie Thr Gly Asn Ser Ala Ala 
325 330 335 

Val He Thr Pro Phe Lys Leu Thr Thr Glu Ala Thr Gin Thr Pro Val 
340 345 35Q 

Ser Asn Lys Lys Pro Val Phe Asp Leu Lys Ala Ser Leu Ser Arg Pro 
355 3 6 o 3g5 

Leu Asn Tyr Glu Pro His Lys Gly Lys Leu Lys Pro Trp Gly Gin Ser 

375 380 



Lys Glu Asn Asn Tyr Leu Asn Gin His Val Asn Arg He Asn Phe Tyr 
390 395 4 oo 

Lys Lys Thr Tyr Lys Gin Pro His Leu Gin Thr Lys Glu Glu Gin Arq 
405 410 a 



415 



Lys Lys Arg Glu Gin Glu Arg Lys Glu Lys Lys Ala Lys Val Leu Gly 
420 425 430 

Met Arg Arg Gly Leu lie Leu Ala Glu Asp 
435 440 



Table 1 7 displays alignment of hMXl, bMX2 with human myosin (SEQ ID 
NO:31;GenBankAF247457)(Ber ge ^/.,2000). As seen fiom the alignment, hMXl and 
hMX2 have a likely N-terminus of M N D residues. One of skill in the art can easily 
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verify this observation by probing cDNA or genomic human libraries, or PGR techniques, 
to acquire the foil length polynucleotide sequence. 

Table 17 Alignment of hMXl, hMX2 and human myosin X 



10 
12 

humX 



12 

humX 



- - -FCLQQTRVWLRENGQHFPSTVNSCAEGIWFRTDYGQVFTYKQSTIT 

1 mdnfftegtrvwlrengqhfpstvnscaegivvfrtdygqvftykqstS 

10 48 HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIWTYIGSIT, 

humX 51 hq ^^™eegvddmasltel^^ 

10 98 ASVNPYQPlAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHnN 

12 98 ASVNPYQPIAGLYBPATMEQYSRRHLGELPPHIPAIA^SSS^m 

huncc ioi asvnpyqpxaglyepatmeqys^elppSSSS™ 

huntt 15 1 QCILISGESC^GK^STKLIX.KFI.SVIsSsLELsSS^S 

10 198 SPIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQGGRIVDCILSSONR 

SPIME!APGNAKTVYraWSSRPGKI^OI. W Tnn^MT^^™"„:?^ 



198 

201 

248 yVRQNPGERNYHIFYALLAGIjEHEEREEFYIiSTPENYHYIiNOSGCVEDKT 

248 vvrqnpgeri^ifyai^gleheereefylstpenyhSqSSSS 
250 vvrqnpgernyhifyali^lf^ereefylstpeSSSgSS 

III ^°^f^ IT ^ WQFSKEE ^ S ^IAGILHLGNIEFITAGGAOV 

300 isdqesfrevitamdvmqfskeevrevsrli^ 

c^^ RSMLLGLDPTQLTDALTQRSMFLRGEEI ltplnvqqavdsrd 
3 so 8 ==SZ^^Z— 

400 sx^vaccfewvikkinsrikgnedfksigildxfgfeiSSIS 

\l ttl ^^^^^"S^Q^SREGLVWEDIDWIDNGECLDLIEKK 

humX 450 NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNGECTBLIE^ 

til ^^r^ ESOTPQAroSmLEmHSQHA ^™RVAVNNFGV^ 
humX J™ ^^I^ESHFPQATDSTIiEKLHSQHANNHFYVKPRVAVNNFG^Y 

humX 500 ^^^shfpqatosiu.eklhsqha^yvkprvaSSJ 

12 548 AGEVO™rJ LE ^ T ^ D ™^ S ^ FITOLFEWSSR ^Q^ 



10 248 
12 

humX 



10 
12 

humX 



10 348 
12 348 
humX 



10 398 
12 

humX 



10 
12 



12 548 AGEVQYDVRGILEKIIRDTFRDDLLbnjl^ESRFDFIYDLFEHVS^ 

humX 550 AGEVQYDVRGILEKNRDTFR^^^ 
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10 
12 

humX 


648 
648 
649 


10 
12 

humX 


698 
698 
699 


10 
12 

humX 


748 
748 
749 


10 
12 

humX 


798 
798 
799 


10 
12 

humX 


848 
848 
849 


10 
12 

humX 


898 
898 
899 


10 
12 

humX 


948 
948 
949 ] 


10 
12 

humX 


998 'i 

998 2 

999 i 



DQAVVLNQLRYSGM^TVRIRKAGYAVRRPFQDFYKRYK^ 
DQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYICVLMIUJLALPBD 
DQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED 

VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 
VRGKCTSLLQLYDASITSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 
VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 

IRAHVLGFLARKQYRKVLYCVVI IQKNYRAFLLRRRFLHIiKXAAIVFQKO 
IRAHVLGFLARKQYRKVLYCWIIQKNYRAFLLRRRFLHLKKAAXVFOKO 
IRAHVLGFIiARKQYRKVLYCWl IQKNYRAFLLRRRFLHXKKAAIVFQKQ 

LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL 
LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL 
LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL 

RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIE 
RAQQEEETRKQQELEAIiQKSQKEAELTREIiEKQKENKQVEEILRIiEKEIE 
RAQQEEETRKQQELEALQKSQKEAELTRELEKQKEHKQVEEILRLEKEIE 

DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD 

DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD 

EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE 
EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE 
EIDECVRNIERSLSVGSEFSSEIAESACEEKPNFNFSQPYPEEEVDEGFE 

998 ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMNDTWPTSPSAD 



10 
12 

humX 1049 



1048 
1048 



STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED 
f^f ,SVQDSGSLHNSSSGEST ^CMPQNAGDLPSPDGDYDYDQDDYED 
STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED 



10 
12 



1098 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 
humX 1099 ^f°^~^!????? D ^ CSVG ™ SGA ^SSEGAQSSFEDS 



1098 



1148 
1148 



GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

^^^ RroroDELS ^ SWSC ^PYFHSFLYMKGGLMNSHKRRWCV 
huntt i^q ^^!^ TODELSY ^SVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 
hun* 1149 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

lilt T L ^!™ PRSKQ ^ K ^^^ GGGSS ^SI^KKRWFVLRQSKLm 
humX ^^^^^QGWLHKKGGGSSTLSRWWKKRWFVLRQSKLMY 
humX 1199 LKDETFLWFRSKQEALKQGWIJUCKCSGGSSTLSRRNWKKRWFVLRQSKLMY 



10 
12 



10 
12 



1248 ™ff^°!^^ IIDimKENGIDII MADRTFHLIAESPEDA 
humX 1249 



1248 FENDSEEKLKGTVEVRTAKEI IDNTTKENGIDIIMADRTF^IA^SPE^A 
FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA 



10 
12 



1298 



SQWFSVLSQVHASTDQEIQEMHDEQAHPQNAVGTLDVGLIDSVCAS; 



DSPD 



1298 SQWFSVLSQ\raASTDQEIQEMHDEQANP^I»VG^VGLIDSVCASDSPD 
hUmX 1299 S Q WPS ^SQVHASTDQEIQEMHDEQA^ 



10 



1348 



GW 
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^^^^^^^^^^^ 

MKEVKNSPKMSSLKLKKRWFVLTHNSI^yYKSSEK^^S^ 

2£S EraFraTCyWN ^GR^CYRLYTKLI^TRWSSVIQN™ 
SVVPPDEKIFKETGYWNVTVyGI^CYRLYTKI^TRWslSQS 

^5^!? QLIQDIKENCLNSD VVEQIY KR Npi LRYTHH p LHSpLLpijP 
KAPlDTPTQQLIQDIKENCnjNSDVVEQIYKKHPILRyTHHPMSPLr pr d 



12 

humX 


1348 
1349 


10 
12 

humX 


1398 
1398 
1399 


10 
12 

humX 


1448 
1448 
1449 


10 
12 

humx 


1498 
1498 
1499 


10 
12 

humX 


1548 
1548 
1549 


10 
12 

humx 


1598 

1598 ] 

1599 ] 


10 
12 

humx 


1648 ] 

1648 I 

1649 I 




PLPSRGILKY 



LKFiaKRIREQFPGTEMEKYALFTYESLKKTKC^PA^S^EIEAIiI^Q 

- s ssassssHssssa™ 

s. = ===== 

11 till ^^^SLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 

PPLEB WSLQRLKARISQSTKTPTPCERLEKRRTSFLEGTLRRSFRTCSV 

U 1948 2£S!E5^^ 

1999 ^SFGAPLAOTYKIVVDERE^ 

in nrnri 



10 2048 RSASSQGSSR 

12 2047 KL 

humX 2049 RSASSQGSSR 
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The invention also includes polypeptides and nucleotides having 80-100%, 
including 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 
sequence identity to SEQ ID NOS:l-16, as well as nucleotides encoding any of these 
polypeptides, and compliments of any of these nucleotides. In an alternative 
embodiment, polypeptides and/or nucleotides (and compliments thereof) identical to any 
one of, or more than one of, SEQ ED NOS:l-16 are excluded. In yet another embodiment, 
polypeptides and/or nucleotides (and compliments thereof) having 81-100% identical, 
including 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 
sequence identity to SEQ ID NOS:l-16 are excluded. 

The nucleic acids and proteins of the invention are potentially useful in promoting 
wound healing, for example after organ transplantation, or in the treatment of myocardial 
infarction, but also in treating tumors, and in cancers, diabetic retinopathy, macular 
degeneration, psoriasis, and rheumatoid arthritis. For example, a cDNA encoding AAP 
may be useful in gene therapy, and AAP proteins may be useful when administered to a 
subject in need thereof. The novel nucleic acid encoding AAP, and the AAP proteins of 
the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed 
These materials are further useful in the generation of Abs that bind immunospecifically 
to the novel substances of the invention for use in therapeutic or diagnostic methods. 

Kelch-like protein (KLP) 

The putative protein encoded by KLP contains 1 putative BTB domain and 4 
putative Kelch motifs. The BTB (broad complex, tramtrack, bric-a-brac)/POZ (poxvirus, 
zinc finger) domain is involved in protein protein interactions. The kelch motif is sixfold 
tandem element in the sequence of the Drosophila kelch ORF1 protein that also contains 
BTB. Kelch ORF1 localizes to the ring canals in the egg chamber and helps to organize 
the F-actin cytoskeleton (Adams et al, 2000). The repeated kelch motifs predict a 
conserved tertiary structure, a p-propeller. This module appears in many different 
polypeptide contexts and contains multiple potential protein-protein interaction sites. 
Members of this growing superfamily are present throughout the cell and extracellularly 
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and have diverse activities (Adams et al, 2000). Such activities include cytoskeleton 
organization, as well as other morphological processes, gene expression, interactions with 
viruses, and various extracellular events, such as cell spreading. 

Ahgnment m&Drosophila kelch and other kelch-like proteins, human kelch-like 
protein (GenBank AAF20938 (SEQ ID NO:17)), hypothetical C. elegans (GenBank 
061795 (SEQ ID N0.18) and the skeletal muscle-specific sarcosin (GenBank 060662 
(SEQIDNO:19)- (Taylors/., 1998)) reveals that the disclosed protein (SEQ ID NO:2) 
is a member of a new subfamily 

KLP is associated with tube formation and angiogenesis because it is upregulated 
in the in vitro model of angiogenesis of Example 1. Kelch mediates cytoskeletal 
associations, it is involved in morphogenetic processes, such as tube formation, that 
depend on cytoskeletal arrangements and signaling. KLP represents an attractive target 
for small molecule drug therapy. 

Human ortholog of mouse BAZF (hBAZF) 

hBAZF (SEQ ID NO:4) is the human ortholog of mouse BAZF (GenBank 
AB011665; S EQmNO: 2 0),BAZFisaB^^ 

that controls germinal center formation and the T cell-dependent immune response 
Expression of Bcl-6 negatively correlates with cellular proliferation: Bcl-6 suppresses 
growth associated with impaired mitotic S phase progression and apoptosis (Albagli et 
al, 1999). ^ 

BAZF contains a BTB/POZ domain and five repeats of the Kruppel-like zinc 
fingermotifs, instead of6 in Bcl-6 (Okabe 1998). Expression of BAZF mRNA is 
relegated to heart and lung, unlike Bcl-6 mRNA, but is induced in activated lymphocytes 
as an immediate-early gene, like Bcl-6 (Okabe et al. , 1998). 

The hBAZF sequence was derived by using tblastn (protein query -translated 
database) (Altschul et al, 1997), with the mouse protein sequence (GenBank 088282; 
SEQ ID NO:21) that has homology to GenBank AC015918 (SEQ ID NO:22), a clone 'of 
Homosapiens chromosome 17. Human BAZF contains five Kruppel-like zinc finger 
motif repeats and a BTB/POZ domain. 

The peptide sequence, "RSQ. . .PQV" that is present in the human sequence might 
represent an alternative spliced form of the gene. Aligment with mouse BAZF, and 
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not as 



augment with mouse and human Bcl-6 demonstrates that the four proteins are almost 
identical in this region, but only human BAZF has this inserted sequence. 

hBAZF is unregulated in HUVE cells grown embedded in collagen gels but 
a monolayer grown on collagen. When HUVE cells are suspended in collagen, they do 
not proliferate. Analagous to the role of mBAZF plays a role in regulating cell 
proliferation (Okabe et a!., 1998), hBAZF plays a roll in cell proliferation in HUVE 
suspended in collagen. Because of its high expression during vessel morphogenesis, 
hBAZF represents an excellent molecular marker, as well as an attractive target for ' 
various therapies to inhibit angiogenesis. 



hmt-Elongation Factor G (hEF-G) 

The original isolation of hEF-G (SEQ ID NO:6) is 84% identical and colinear 
with Rattus norvegicus nuclear encoded mitochondrial elongation factor G (GenBank 
L14684 (SEQ ID NO:23); (Barker et al, 1993). No human gene j, described in 
GenBank. However, searching EST databases, the human gene is contained inside 
GenBank AC01 0936 (SEQ ID NO:24), a chromosome 3 clone. Aligment of hEF-G with 
rat mtEF-G and yeast EF-G1 demonstrates that the novel sequence is the ortholog of rat 
nuclear-encoded mitochondrial elongation factor G. 

Bacterial elongation factor G (EF-G) physically associates with translocation- 
competent ribosomes and facilitates transition to the subsequent codon through the 
coordinate binding and hydrolysis of GTP. The deduced amino acid sequence of hmt-EF- 
G reveals characteristic motifs shared by all GTP binding proteins. Therefore, similarly 
to other elongation factors, the enzymatic function of hmt-EF-G is predicted to depend on 
GTP binding and hydrolysis. 

Hmt-EF-G is strongly induced (30-fold) in an in vitro model of angiogenesis 
(Example 1), and as such, hmt-EF-G represents an excellent molecular marker for vessel 
formation. Because of its putative localization to the mitochondrion, hmt-EF-G is also an 
attractive therapeutic target to treat disease states associated with mitochondrial 
dysfunction. 



Human thyroid regulated transcript (hTRG) 
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hTRG (SEQ ID NO:8) is the human ortholog of rat TRG, a novel thyroid 
transcript negatively regulated by TSH (GenBank KIAA1058 (SEQ ID NO:25); 
(Bonapace et ai, 1990). 

SEQ ID NO:25 appears to be a partial peptide since there are C. elegans 
homologous proteins of 2000 residues. Using tblastn (Altschul et ai, 1997) against 
genomic sequences, the hTRG sequence (SEQ ID NO:8) was assembled. 

In C. elegans, homologous proteins localize either to the plasma membrane or to 
the mitochondrial inner membrane. A partial sequence, KIAA0694 (SEQ ID NO:26) also 
localizes to the mitochondrial matrix. hTRG has a PH domain, and has weak homology 
to an extracellular fibronectin-binding protein precursor. SEQ ID NO:26 has homology 
to Drosophila DOS and mouse Gab-2 proteins; both of which are involved in signal 
transduction, acting as adapter proteins between receptors and kinases like Rasl (Hibi and 
Hirano, 2000). 

Because of hTRG is upregulated during the in vitro model of angiogenesis 
(Example 1), and because of its homologies with adapter proteins, hTRG is likely to be 
involved in signal transduction between receptors and kinases. As such, hTRG represent 
an excellent candidate for small molecule drug therapy to modulate angiogenesis and treat 
angiogenesis-related diseases. In addition, because of its putative ability to respond to 
thyroid stimulating hormone (TSH), modulation of hTRG is useful to treat diseases 
related to TSH imbalance. 

Human myosinX(hMXl(SEQIDNO:10) and hMX2 (SEQ ID NO. J2) 
The hMX proteins represent the human ortholog of bovine myosin X, (GenBank 
AAB39486; SEQ ID NO:27). Using tblastn (Altschul et ai, 1997) and the bovine 
sequence, a series of genomic clones from human chromosome 5 were identified; 
GenBank AC010310 (SEQ ID NO:28) appears to contain the entire sequence. 
Interestingly, a partial cDNA sequence from mouse (GenBank AF1 84153; SEQ ID 
NO:29) localizes to a 0.8 cM interval on the short arm of chromosome 5, between the 
polymorphic microsatellite markers D5S416 and D5S21 14. In this region lies the gene 
for familial chondrocalcinosis (CCAL2) (Rojas et al., 1999). 

Another GenBank entry, AB018342 (SEQ ID NO:30) that represents the 3' region 
of hMX, appears to encode an alternative splice form. Noteworthy, this variant (hMX2) 



WO 01/70808 



-61 - 



PCT/USOl/09609 



has a very hydrophobic carboxy terminus, while the more prevalent form (hMXl) i s 
hydrophilic and potentially interacts with DNA/RNA since it has homology to high 
mobility group box (HMG) and ribosomal proteins. Additionaly, a myosin head domain 
was found in the NH terminus, as well as a myosin talin domain, two calmodulin binding 
domains, four pleckstrin domains and a band 4.1 domain. 

The band 4. 1 domain represents a crossroads between cytoskeletal organization 
and signal transduction. The domain was first described in the red blood cell protein band 
4.1. The ERM proteins ezrin, radixin, and moesin and the unconventional myosins Vila 
and Xall possess the band 4.1 domain (Louvet-Vallee, 2000). Theband4.1 domain 
binds single transmembrane protein at the membrane-proximal region in the C-terminal 
cytoplasmic tail. 

HMX is upregulated during angiogenesis in an in vitro model (Example 1). 
Because hMX contains the protein-protein interaction domains PH and band 4. 1 domain, 
hMXl and hMX2 are involved in angiogenesis, likely transducing signals from 
angiogenic factors, perhaps modulating the cytoskeleton. 

Human mitochondrial protein (hMP) 

Analysis of hMP (SEQ ID NO: 14) reveals several subdomain that are homologous 
to proteins involved in transport across membranes, K + ATPase a and y chains. Further 
analysis indicates that hMP may bind DNA and or RNA, since hMP is homologous to 
histones and transcription factors, especially those possessing basic region plus leucine 

zipper domains. 

Although PSORT analysis (Nakai and Horton, 1999) predicts nuclear localization 
(P-.6), hMP may in fact be a nuclear-encoded mitochondrial protein. Homologies with 
mostly bacterial proteins and a PSORT prediction ofinitochondrial matrix space 
localization (P=0.4478) strongly support this contention. 

Because hMP is upregulated in an in vitro model of angiogenesis (Example 1), 
and because of its homologies with mitochondrial and nuclear-localized polypeptides,' 
hMP is important in vascular morphogenesis, most likely through either powering the' 
cellular differentiation-redifferentiation process, and/or affecting changes in the nuclear 
matrix that change global gene expression. Alternatively, hMP may be a transcription 
factor for either the nuclear or mitochondrial genomes. 
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Nuclear hormone receptor (NHR) 

NHRCSEQlDNO^QhastwodomainsrmtheNHrerionic - -, 
fveast SWT/qwtt , , UjmeiMH reborns similar to Swi3 

2^~ reguhli „^ (2)tecoOH ^ onis 

I999,p re d 1 c tsUlepr „ letatoIo(:afeintheii|)claisiM)96oo "»«»'>. 

The identification of this new nutatiw hnrm^ a . 

and h^ATv n ewputanve hormone receptor-transcriptional regulator 



^).™anSnaA,andlus to nedeace W ase<HDAC). hBAZF, „nich abo 

tabular structures (Lansinfc a a/., I09 8) . P ^ 

seq^estemig hormones and suppressing transcription. Because nHR is down-regulated 
^.ymurenpiestaaredirec^atprever^gvascuiarWoI ' 



AAP polynucleotides 



AtforbtologtcaUy-acaveporao^aereof Abo included in the invention are „„ , 
^>™^tfcauonaod,or m u<^ 



WO 01/70808 



-63- 



PCT/US01/09609 



molecule" includes DNA molecules {e.g., cDNA or genomic DNA), RNA molecules 
(e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs and 
denvahves, fragments and homologs. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably comprises double-stranded DNA. 
1- probes 

Probes are nucleic acid sequences of variable length, preferably between at least 
^outlOnucleonde S (nt)^^^ 

Probes are used to detect identical, similar, or complementary nucleic acid sequences ' 
Longer length probes can be obtained from a natural or recombinant source, are highly 
^ecxfic, and much slower to hybridize than shorter-length oligomer probes. Probes may 
be .angle- or double-stranded and designed to have specificity in PGR, membrane-based 
hybruization technologies, or ELISA-like technologies. Probes are substantially purified 

50,100, 150,200,250,300,350 or 400 consecutive sense strand nucleotide sequenced ' 
SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, or 15; or an anti-sense strand nucleotide sequence of 
these sequences; or of a naturally occurring mutant of these sequences. 

^^P^^gfrnativesequ^ 
(homologous) sequences (Ausubel * al, 1987; Sambrook, 1989), such as: (1) full-length 

fehne, canme, bacterial, viral, retroviral, yeast), (2) from cells or tissues, (3) variants 
wxthm a species, and (4) homologues and variants from other species. To find related 
sequences that may encode related genes, the probe may be designed to encode unique 
sequences or degenerate sequences. Sequencesmay also be genomic sequences including 
promoters, enhancer elements and introns of native sequence AAP. 

For example, an AAP coding region in another species may be isolated using such 
probes. Aprobeofabout40basesisde S igned,basedonan^>,andmad, Todetect 
hybndxzations, probes are labeled using, for example, radionuclides such as 32 P or «S or 
enzymatic labels such as alkaline phosphatase coupled to the probe via avidin-biotin ' 
systems. Labeled probes are used to detect nucleic acids having a complementary 
sequence to that of an AAP in libraries of cDNA, genomic DNA or mRNA of a desired 
species. 
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Such probes can be used as a part of a diagnostic test kit for identifying cells or 
tissues which mis-express an AAP, such as by measuring a level of an AAP in a sample of 
cells from a subject e.g., detecting AAP mRNA levels or deterniining whether a genomic 
AAP has been mutated or deleted. 

2. isolated nucleic acid 

An isolated nucleic acid molecule is separated from other nucleic acid molecules ' 
that are present in the natural source of the nucleic acid. Preferably, an isolated nucleic 
acid is free of sequences that naturally flank the nucleic acid (/.*, sequences located at the 
5'- and 3'-tennini of the nucleic acid) in the genomic DNA of the organism from which 
the nucleic acid is derived. For example, in various embodiments, isolated AAP 
molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). 
Moreover, an isolated nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13 or 15, or a complement ofthis 
aforementioned nucleotide sequence, can be isolated using standard molecular biology 
techniques and the provided sequence informatioa Using all or a portion of the nucleic 
acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15 as a hybridization probe, AAP 
molecules can be isolated using standard hybridization and cloning techniques (Ausubel 
etal, 1987; Sambrook, 1989). 

PCR amplification techniques can be used to amplify^ using cDNA, mRNA 
or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers. 
Such nucleic acids can be cloned into an appropriate vector and characterized by DNA 
sequence analysis. Furthermore, oligonucleotides corresponding to AAP sequences can 
be prepared by standard synthetic techniques, e.g., an automated DNA synthesizer. 

3. oligonucleotide 

An oligonucleotide comprises a series of linked nucleotide residues, which 
oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction 
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or other applicant A short oligonucleotide sequence may be based on, or designed 
from, a genomic or cDNA sequence and is used to amplify, confirm, or reveal the 
presence of an identical, similar or complementary DNA orRNAina particular cell or 
tssue. Oligonucleotides comprise portions ofa nucleic acid sequence having about 10 nt, 
50 nUrlOOnt in length, preferably about 15 nt to 30 nt in length. In one embodiment of 
the nxvenuon, an oligonucleotide comprising a nucleic acid molecule less than 1 00 nt in 
length would further comprise at least 6 contiguous nucleotides ofSEQlDNOSl 3 5 
7.^U3orl5,oracomplementthereof. Oligonucleotides may be chemically ' 
synthesized and may also be used as probes. 

4. complementary nucleic acid sequences; binding 
In another embodiment, an isolated nucleic acid molecule of the invention 
compnsesanucl^ 

m SEQ ID NOS: 1, 3,-5, 7, 9, 1 1, 13 or 15, or a portion of this nucleotide sequence (eg a 
fragmentfcatca.beused^^ ' 

active poruonofanAAP). Anucleic acid molecule that is complementary to the 
nucleot.de sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15, is one that is 
suffic,ently complements^ 3 $ ? 

9, 11, 13or 15, that it canhydrogen bond with little or no mismatches to the nucleotide ' 

sequence shown in SEQ ID NOS- 1 3 5 7fln iwua t. , 

v J, 3, ^9, U, 13 or 15, mereby forming a stable 

duplex. 

"Complementary" refers to Watson-Crick or Hoogsteen base pairing between 
nucleotides units of a nucleic acid molecule, and the term "binding" means the physical 
or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof Binding includes ionic, non-ionic 
vanderWaals,hydrophobicmteractions,andthelilc & A physical interaction can be ' 
ather direct or indirect Indirect interactions may be through or due to the effects of 
anotherpolypeptideorcompound. Direct binding refers to interactions that do not take 
Place through, or due to, the effect ofanother polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Nucleic acid fragments are at least 6 (contiguous) nucleic acids or at least 4 
(contiguous) amino acids, a length sufficient to allow for specific hybridization in the 
caseofnucleic acids or for specific recognition of an epitope m me case of amino acids 
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respectively, and are at most some portion less than a full-length sequence. Fragments 
may be derived from any contiguous portion of a nucleic acid or amino acid sequence of 
choice. 

5. derivatives, and analogs 

Derivatives are nucleic acid sequences or amino acid sequences formed from the 
native compounds either directly or by modification or partial substitution. Analogs are 
nucleic acid sequences or arnino acid sequences that have a structure similar to, but not 
identical to, the native compound but differ from it in respect to certain components or 
side chains. Analogs may be synthetic or from a different evolutionary origin and may 
have a similar or opposite metabolic activity compared to wild type. Homologs are 
nucleic acid sequences or amino acid sequences of a particular gene that are derived from 
different species. 

Derivatives and analogs maybe full length or other than full length, if the 
derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 
sequence of identical size or when compared to an aligned sequence in which the 
alignment is done by a computer homology program known in the art, or whose encoding 
nucleic acid is capable of hybridizing to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent conditions 
(Ausubelera/., 1987). 

6. homology 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of AAP. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA Alternatively, 
different genes can encode isoforms. In the invention, homologous nucleotide sequences 
include nucleotide sequences encoding for an AAP of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, 
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rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences also 
include, but are not limited to, naturally occurring allelic variations and mutations of the 
nucleotide sequences set forth herein. A homologous nucleotide sequence does not, 
however, include the exact nucleotide sequence encoding human AAP. Homologous 
nucleic acid sequences include those nucleic acid sequences that encode conservative 
amino acid substitutions (see below) in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, as well 
as a polypeptide possessing AAP biological activity. Various biological activities of the 
AAP are described below. 

7. open reading frames 

The open reading frame (ORF) of an AAP gene encodes an AAP. An ORF is a 
nucleotide sequence that has a start codon (ATG) and terminates with one of the three 
"stop" codons (TAA, TAG, or TGA). In this invention, however, an ORF may be any 
part of a coding sequence thatmay or may not comprise a start codon and a stop codon. 
To achieve a unique sequence, preferable AAP ORFs encode at least 50 amino acids. 

AAP polypeptides 

1- mature 

An AAP can encode a mature AAP. A "mature" form of a polypeptide or protein 
disclosed in the present invention is the product of a naturally occurring polypeptide or 
precursor form or proprotein. The naturally occurring polypeptide, precursor or 
proprotein includes, by way of nonlimiting example, the full-length gene product, 
encoded by the corresponding gene. Alternatively, it may be defined as the polypeptide 
precursor or proprotein encoded by an open reading frame described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell in 
which the gene product arises. Examples of such processing steps leading to a "mature- 
form of a polypeptide or protein include the cleavage of the N-terminal methionine 
residue encoded by the initiation codon of an open reading frame, or the proteolytic 
cleavage of a signal peptide or leader sequence. Thus a mature form arising from a 
precursor polypeptide or protein that has residues 1 to N, where residue 1 is the N- 
terminal methionine, would have residues 2 through N remaining after removal of the N- 
terminal methionine. Alternatively, a mature form arising from a precursor polypeptide 
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orprotein having residues 1 to N,in which an N-tenninal signal sequence from residue 1 
to residue M is cleaved, would have the residues from residue M+l to residue N 
remans Further as used herein, a "mature" form of a polypeptide or protein may arise 
from a step of post-translational modification other than a proteolytic cleavage event 
Such additional processes include, by way of non-limiting example, glycosylate 
-ynstoylation or phosphorylation, m general, a matu. polypeptide or protein may resnlt 
from the operation of only one of these processes, or a combination of any of them. 
2. active 

An active AAP polypeptide or AAP polypeptide fragment retains a biological 
and/or an immunological activity similar, but not necessarily identical, to an activity of a 
naturaUy-oc^ 

particular biological assay.with or without dose dependency, can be used to determine 
AAP activity. A nucleic acid fragment encoding a biologically-active portion of AAP can 
bepreparedbyisolating aportionof SEQIDNOS:!, 3, 5, 7, 9, 11, 13 or 15 that encodes 
a polypeptide having an AAP biological activity (the biological activities of the AAP are 
described below), expressing the encoded portion of AAP (e. g ., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of AAP 
Immunological activity refers to the ability to induce the production of an antibody 
against an antigenic epitope possessed by a native AAP; biological activity refers to a 
function, either inhibitory or stimulatory, caused by a native AAP that excludes 
immunological activity. 

AAP nucleic acid variants and hybridization 

1. variant polynucleotides, genes and recombinant genes He invention 
further encompasses nucleic acid molecules that differ from the nucleotide sequences 
shown in SEQ ID NOS.l, 3, 5, 7, 9, 1 1, 13 or 15 due to degeneracy of the genetic code 
and thus encode the same AAP as that encoded by the nucleotide sequences shown in 
SEQ ID NO NOS:l, 3, 5, 7, 9, 1 1, 13 or 15. An isolated nucleic acid molecule of the 
mvention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. 

Inadditiontothe^sequencesshowninSEQIDNOS:!^^ 7 9 H 13 or 
15, DNA sequence polymorphisms that change the amino acid sequence of die AAP may 
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exist within a population. For example, allelic variation among individuals will exhibit 
genetic polymorphism in an AAP. The terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an AAP, 
preferably a vertebrate AAP. Such natural allelic variations can typically result in 1-5% 
variance in an AAP. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in an AAP, which are the result of natural allelic variation and that do not 
alter the functional activity of an AAP are within the scope of the invention. 

Moreover, AAP from other species that have a nucleotide sequence that differs 
from the human sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15, are contemplated. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of an 
AAP cDNAs of the invention can be isolated based on their homology torn AAP of SEQ 
IDNOSrl, 3, 5, 7, 9, 11, 13 or 15 using cDNA-derived probes to hybridize to 
homologous AAP sequences under stringent conditions. 

"AAP variant polynucleotide" or "AAP variant nucleic acid sequence" means a 
nucleic acid molecule which encodes an active AAP that (1) has at least about 80% 
nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length 
native AAP, (2) a full-length native AAP lacking the signal peptide, (3) an extracellular 
domain of an AAP, with or without the signal peptide, or (4) any other fragment of a full- 
length AAP. Ordinarily, an AAP variant polynucleotide will have at least about 80% 
nucleic acid sequence identity, more preferably at least about 81%, 82%, 83%, 84% 
85%, 860/c, 87o/„, 88o/„, 89o/ 0 , 90o/ o , 91%, 9 2 o/o, 93%, 94%, 95%, 96o/ 0 , 97%, 98% nudeic 
acid sequence identity and yet more preferably at least about 99% nucleic acid sequence 
identity with the nucleic acid sequence encoding a full-length native AAP. An AAP 
variant polynucleotide may encode a full-length native AAP lacking the signal peptide, an 
extracellular domain of an AAP, with or without the signal sequence, or any other 
fragment of a full-length AAP. Variants do not encompass the native nucleotide 
sequence. 

Ordinarily, AAP variant polynucleotides are at least about 30 nucleotides in 
length, often at least about 60, 90, 120, 150, 180, 210, 240, 270, 300, 450, 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to AAP-encoding 
nucleic acid sequences identified herein is defined as the percentage of nucleotides in a 
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canchdate sequence that are identical with the nucleotides in the AAP sequence of interest 
after ahgmng the sequences and introducing gaps, if necessary, to achieve the maximum ' 
percent sequence identity. Alignment for purposes of determining % nucleic acid 
sequence identity can be achieved in various ways that are within the skill in the art for 
mstance, using publicly available computer software such as BLAST, BLAST-2 ALIGN 
or MegalignCDNAST^software. Those skilled in the art can determine appropriate 
parameters for measuring alignment, including any algorithms needed to achieve 
maximal alignment over the full length of the sequences being compared. 

When nucleotide sequences are aligned, the % nucleic acid sequence id ^ ty q{& 
given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a 
certam % nucleic acid sequence identity to, with, or against a given nucleic acid sequence 
D) can be calculated as follows: 

%nucleic acid sequence identity = W/Z ' 100 
where 

Wismen ^berofnucleoadescoredasidenticalmatchesb y the S equence 
aUgnment program's or algorithm's alignment of C and D 

and 

Z is the total number of nucleotides in D. 

When the length of nucleic acid sequence C is not equal to the length of nucleic 
ac,d sequence D, the % nucleic acid sequence identity of C to D will not equal me % 
nucleic acid sequence identity of D to C. 

2. Stringency 

Homologs (i.e., nucleic acids encoding an AAP derived from species other than 
human) or other related sequences (e. g ., paralogs) can be obtained by low, moderate 
high stringency hybridization with all or a portion of the particular human sequence 
probe using methods well known in the art for nucleic acid hybridization and cloning 

The specificity of single stranded DNA to hybridize complementary fragments is 
determined by the "stringency" of the reaction conditions. Hybridization stringency 
increases as the propensity to form DNA duplexes decreases. In nucleic acid 
hybridization reactions, the stringency can be chosen to either favor specific 
hybridizations (high stringency), which can be used to identify, for example, fulWength 



or 
as a 
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clones from a library. Less-specific hybridizations (low stringency) can be used to 
identify related, but not exact, DNA molecules (homologous, but not identical) or 
segments. 

DNA duplexes are stabilized by: (1) the number of complementary base pairs, (2) 
the type of base pairs, (3) salt concentration (ionic strength) of the reaction mixture, (4) 
the temperature of the reaction, and (5) the presence of certain organic solvents, such as 
formamide which decreases DNA duplex stability. In general, the longer the probe, the 
higher the temperature required for proper annealing. A common approach is to vary the 
temperature: higher relative temperatures result in more stringent reaction conditions. 
(Ausubel et al, 1987) provide an excellent explanation of stringency of hybridization 
reactions. 

To hybridize under "stringent conditions" describes hybridization protocols in 
which nucleotide sequences at least 60% homologous to each other remain hybridized. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and/7H TheTm 
is the temperature (under defined ionic strength,/,!! and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at 
Tm, 50% of the probes are occupied at equilibrium, 
(a) high stringency 

"Stringent hybridization conditions" conditions enable a probe, primer or 
oligonucleotide to hybridize only to its target sequence. Stringent conditions are 
sequence-dependent and will differ. Stringent conditions comprise: (1) low ionic strength 
and high temperature washes (e.g. 15 mM sodium chloride, 1.5 mM sodium citrate, 0.1 % 
sodium dodecyl sulfate at 50°C); (2) a denaturing agent during hybridization (e.g. 50% 
(v/v) formamide, 0.1% bovine serum albumin, 0.1% FicolL 0.1% polyvinylpyrrolidone, 
50mM sodium phosphate buffer (pH 6.5; 750 mM sodium chloride, 75 mM sodium 
citrate at 42°Q; or (3) 50% formamide. Washes typically also comprise 5X SSC (0.75 M 
NaCL 75 mM sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0.1% 
SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55«C, followed by a high-stringency wash 
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consisting of 0.1 x SSC containing EDTA at 55°C. Preferably, Reconditions axe such 
that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous 
to each other typically remain hybridized to each other. These conditions arepresented as 
examples and are not meant to be limiting, 
(b) moderate stringency 

"Moderately stringent conditions" use washing solutions and hybridization 
concerns that are less stringent (Sambrook, 1989), such that a polynucleotide will 
hybridize to the entire, fragments, derivatives or analogs of SEQ ID NOS:l, 3, 5 7 9 „ 
13 or 15. One example comprises hybridization in 6X SSC, 5X Denhardfs'soiution, 0 5% 
SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more 
washesinlXSSC,0.1%SDSat37°C. The temperature, ionic strength, etc., can be 
adjusted to accommodate experimental factors such as probe length. Other moderate 
stringency conditions are described in (Ausubel et al, 1987; Kriegler, 1990). 
(c) low stringency 

"Low stringent conditions" use washing solutions and hybridization conditions 
that are less stringent than those for moderate stringency (Sambrook, 1989), such that a 
polynucleotide will hybridize to the entire, fragments, derivatives or analogs of SEQ ID 
NOS:l,3, 5, 7,9, 11, 13 or 15. A non-limiting example of low stringency hybridization 
conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7 5) 5 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC 
25 mMTris-HCl 0>H 7.4), 5 mMEDTA, and 0.1% SDS at 50<>C. Other conditions of low 
stringency, such as those for cross-species hybridizations are described in (Ausubel et al 
1987; Kriegler, 1990; Shilo and Weinberg, 1981). 
3. Conservative mutations 

hi addition to naturaUy-occurring allelic variants of AAP, changes can be 
introducedbymutationintoSEQIDNONOS:l,3,5,7,9, 11, 13 or 15 sequences that 
incur alterations in the amino acid sequences of the encoded AAP that do not alter the 
AAP functioa For example, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues can be made in the sequence of SEQ ID NOS 2 4 
6, 8, 10, 12, 14 or 16. A "non-essential" amino acid residue is a residue that can be 
altered from the wild-type sequences of the AAP without altering their biological activity 
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whereas an .essential" amino acid residue is required for such bioiogical activity. For 
example^ amino acid residues tha, are c^rv* among to ^ of fte invention 
predtcted ,o be particularly non-amenable to alteration. Ammo acids for which 
conservative substitutions can be made are well-known in the art 

Useful conservative substitutions an, shown in Table A, "Preferred substitutions ■ 
Conservative ~o-wh m b y an arnino acid ofone class is Macedwrtttarrotirer' 

polypeptide's biological activity. 



Table A Preferred substitutions 



____priginal residue 
Ala (A) 
ArgfR) 
Asn(N) 
Asp(D) 


Exemplary substitutions 

Val, Leu. He 
Lys, tiln, Asn 
Gin, His, Lvs. Arg 


_ .rteterred substitutions 
Val 
Lys 
Gin 


Cys CC) 

Gin CO) 
Glu(E) 


Glu 
Ser 
Asn 

Pro. Ala 

Asn, Gin. Lys, Arg 


Glu 
Ser 
Asn 

Asg 


His (H) 


Ala 

Arg 


lie (I) 


Leu, Val, Met, Ala, Phe, 
Norleucine 


Leu 


Leu(L) 

LysfK) 
Met(M) 
Phe (F) 
Pro an 


Norleucine, De, Val, Met, Ala, 
Phe 
Are, Ciln, Asn 
■Leu, Fiie, He 
Leu, Val. lie. Ala, Tyr 
Ala 


lie 

Arg 
Leu 
Leu 
Ala 


Ser(S) 
Thrm 
Trp(W) 
TyrfY) 

Val(V) 


Thr 
Ser 
Tvr.Phe 
Trp. Phe, Thr, Ser 
lie, i^u, Met, Phe, Ala, 
Norleucine 


Thr 
Ser 

Tyr 

Phe 

Leu 



WO 01/70808 



-74- 



PCT/US01/09609 



backbone, such as a p-shee, o, a-helical contain, (2) fte charge or (3) 

PXmtM or idaltjty ResiduK m 

—on . dmoted fa Tab , e R ^ 



sites 



Table B Amino acid classes 



Class 



hydrophobic 



_ neutral hydrop hilifi 



acidic 



basic 



disrupt chain conformation 



aromatic 



Amino acids 



^TieucineJMet^^ 

ivs, Ser, Thr ' 



Asp, Glu 



Asn, Gin, His, Lys. Are 



Gly, Pro 



Trp,Tyr,Phe 



genesis. Site^ected mutagenesis (Carter, 1986; Zoller and Smith, 1987), cassette 

-bmquescanbeperformedonmeclonedDNAtoproduceme 
(Ausubel et al, 1987; Sambrook, 1989). 

about95/„homologoustoSEQlDNOS:l,3,5,7,9, 11, 13 >or 15 

A mutant AAP can be assayed for blocking augic^s * vitra 

4. Anti-sense nucleic acids 

Using annserae and sense AAP oligonndeoddes can prevent AAP po lyp »nde 
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other means. ^ y 

Hoogsteen base pairing rules The anti-sense nucleic acid molecule can be 
complex to the mto coding region of an ^ but more p 

ontyapomonoftacodingonnonco^g^onofan^mRNAPo,^,;^ 
2-eo.„ d _^ — ^^^^^ ^ 

fegn.en.ofu.eAAPDKAcoding^o^^,^,,^ 

engmormore. Among ofcers, (S,em and Cohen, l^vanderKroL,.,, I988a) 
descnbe meftods to derive antisense or a sense oligonucleotides ten a given cDNA 

sequence. ^ 

nucletc actd tnclude: 5-fluorouracil, 5-bromou.aci,, ^ 5 . iodouracjl 
hypoxanthine, xanthine, 4-acayIcyteine, SKcarboxyhydroxylmclhyl) uracil 5-' 
crtoxymemylanunome^^ 

diydrouradl.b.ta.D-galac.osylcueoame.mosmc.NS^ ' 
methylguanine, l-mett-ylir^e, 2^efl,ylguamne, 2-memyiadenine 2- 

3-methy.cytosine, S.memy.cytosine, ^adenine, T-mefcyiguanine, 5 . 
melhylammomemyluraoil, 5^eu W xyaminome t hyl-2- t hiouracil, beta-D- 
j—aueosinO'-meta^^ 

d-ytosme, S-meW-thiouraci,, 2-thiouracil, 4-tniouracil, 5-memyiuracH uracil-5-' 
oxyaceuc acid me.hyles.er, uraciM-oxyacetic acid (v), ^memyl-2.,hi„uracil H3 - 

fenced RNA wiu be complement «„ a targe, nucieic acid of interest 
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To introduce antisense or ^ oIigomioleolid( , ^ ^ ^ ^ ^ 
the targe, nucleic acid sequence), any gene transfer mediod may be ^ ExamplK of 
gene to* memoes include (1) biological, such as gene Wer vectors like Epstein- 
Bar, vnus or conjugating the exogenous DNA to a ligand-binding molecule, (2) physical 
such as electroporation and injection, and (3, chemical, such as CaP0 4 precipitation^ 
oligonucleotide-lipid complexes. 

An antisense or sense oligonucleotide is inserted into a suitable gere transfer 
retroviral vector. A cell contaimng the target nucleic acid sequence is contacted with the 
recombinant retrovind vector, either* i™, or axviVo. Examples of suiMcrenovnal 
vectors include those derived from me murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or me double copy vectors designated DCT5A, DCT5B and 
DCT5C (WO 90/13641, 1990). To achieve sufficient nucleic acid molecule 
transcription, vector construe* in which the transcript™ of the anti-sense nucleic acid 
molecule is controlled by a strong pol D or pol in promoter are preferred 

To specify target cells in a mixed population of cells cell surface receptors that are 
specie to me target cells can be exploited Antisense and sense oligonucleotides a* 
con J ugated,oaIigand-bir^ gm olecule,asdescrib^m(WO91/04753, 1 9o 1) Ligmds 
- arccbosenfcrreceptots that are specific to the targe, cells. Examples of, uitab,e ligaud- 
btndtng molecules include cell surface receptors, growti, factors, cytokines, or other 

hgand-bmdmgmolecmeaoesnotsubsnmtiallyinterferewimtheabffltyofm^ 

antisense oligonucleotide or its conjugated version into the cell. 
■ Liposomes efficiently transfer sense or an antisense oligonucleotide to cells (WO 
90/10448, .990,. The sense or antisense oligonucleotide-lipid complex is preferably 
dissociated within the cell by an endogenous lipase. 

The anti-sense nucleic acid molecule of the invention may be ana-a„omeric 
nucleic acid molecule. An .«rto nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, coutiaiy to me usual ounits the 

runparaUel to each other(Gautier « at., ,987). The anti-sense nucleic acid 

moIecmecanalsoconrnriseaZ'-o-methylribonucleotideanouee,^ i9 87a)ora 
chimeric RNA-DNA analogue (Inoue a al., 1987b). 
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I» one embodiment, an mti . sense „ ucleic acjd Qf fte tavenaon , s a 
cleavmg a smgle-stranded nucleic acid, such as a„ mRNA, K wMch ^ ^ , 

based -•»-«to«.-»™. rf ,^.« tt(tll>a!piD - " 

wh 1 ch te ^ leoM e S e qne „cc„f tll e rtres i t eisc„ nwlOTm ^ totenucMde 
.. -6 7 2, ,* 2; us . paten(No ^ 

RNA molecules (Battel and Szostak, 1993). 

^^■^exp.essioncante^^^ ^ 

cells<Helene, 1991;Helene«< 1992;Maher, 1992). -"^target 
Modifies offense and ^.lig..^^^^ 

91/ 0629, ,99.), urease ,„ vivo stabffity by conferring resistonce „ endogenous 

Wdo„ienes<WO90/10448, .^orpoly-^ysine. Omer attachment 
mc^^ m dn, gspedfidliKofaeoBgomcMdesfortteirtar ^ 

complexes or intercalating ellipticine) and alkylating agents 

moH . '7^* to ^ b ^^ ttl '-«>°"=<'f'hcn»c I eic acids can be 

ntr K ^ byaPSM "^^^^'o-l 
nucleobases are retained Tnenc^ral backbone ofPNAsaUows for specinc 

bytndrzanon toDNA and KNA under conditions of ,ow ionics*^ 
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WA oligomers can be performed using standard solid phase peptide synthesis protocols 

(Hyrup and Nielsen, 1996; Perry-O-Keefe et al, 1996). 

PNAs of an AAP can be used in therapeutic and diagnostic applications For 
example, PNAs can be used as anti-sense or antigene agents for sequence-specific 
modulation of gene expression by inducing transcription or translation arrest or inhibiting 
replication. AAP PNAs may also be used in the analysis of single base pair mutations 
(e.g., PNA directed PGR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e. g ., S, nucleases (Hyrup and Nielsen, 1996)- or as 
probes or primers for DNA sequence and hybridization (Hyrup and Nielsen, 1996- Perry- 
OKeefe etal, 1996). ' 

PNAs of an AAP can be modified to enhance their stability or cellular uptake 
Lipophilic or other helper groups may be attached to PNAs, PNA-DNA dimmers formed 
or the use ofliposomes or other drug delivery techniques. For example, PNA-DNA 
chimeras can be generated that may combine the advantageous properties of PNA and 
DNA Such chimeras allow DNA recognition enzymes {e.g., RNase H and DNA 
polymerases) to interact with the DNA portion while the PNA portion provides high 
bmdmg affinity and specificity. PNA-DNA chimeras can be linked using linkers of 
appropnate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (Hyrup and Nielsen, 1996). The synthesis of PNA-DNA 

Forexample a 

DNA chain can be synthesized on a solid support using standard phosphoramidite 
coupling chemistry, and modified nucleoside analogs, e. g ., 5'-(4-methoxytrityl)amino-5'- 
deoxy-thynudine phosphoramidite, can be used between the PNA and the 5' end of DNA 
(Fmn et al, 1996; Hyrup and Nielsen, 1996). PNA monomers are then coupled in a 
stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn etal, 1996). Alternatively, chimeric molecules can be synthesized witha 
5' DNA segment and a 3' PNA segment (Petersen et al, 1976). 

The oligonucleotide may include other appended groups such as peptides (e g. for 
targeting host cell receptors /„ vivo), or agents facilitating transport across the cell 
membrane (Lemaitre et al, 1 987; Letsinger et al. ,1989) or PCX Publication No 

W088/098lO)ormeblood-brainbarrier(e. ? .,PCTPubUcationNo.W089/10134).In 
addition, oligonucleotides can be modified with hybridization-triggered cleavage agents 
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(van derKrol etal, 1988b) or intercalating agents (Zon, 1988). He oligonucleotide may 
be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking 
agent, a transport agent, a hybridization-triggered cleavage agent, and the like. 

AAP polypeptides 

One aspect ofthe invention pertains to isolated AAP, and biologically-active 
portions derivatives, fragments, analogs or homologs thereof Also provided are 
polypeptide fragments suitable for use as irnmunogens to raise anti-AAP Abs In one 
embodiment, a native AAP can be isolated from cells or tissue sources by an appropriate 
purification scheme using standard protein purification techniques. In another 
embodiment, AAP are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an AAP can be synthesized chemically using standard peptide 
synthesis techniques. 

1. Polypeptides 

An AAP polypeptide includes the amino acid sequence of an AAP whose 
sequences are provided in SEQ ID NOS.2, 4, 6, 8, 10, 12, 14or 16. The invention also 
mcludes a mutant or variant protein any of whose residues may be changed from the 
corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, while still 
encoding a protein that maintains its AAP activities and physiological functions, or a 
functional fragment thereof 

2. Variant AAP polypeptides 

In general, an AAP variant that preserves an AAP-like function and includes any 
vanant m which residues at a particular position in the sequence have been substituted by 
other ammo acids, and further includes the possibility of inserting an additional residue or 
resumes between two residues ofthe parent protein as well as the possibility of deleting 
one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is 
a conservative substitution as defined above. 

"AAP polypeptide variant" means an active AAP polypeptide having at least: (1) 
about 80% amino acid sequence identity with a full-length native sequence AAP 
polypeptide sequence, (2) an AAP polypeptide sequence lacking the signal peptide (3) an 
extracellular domain of an AAP polypeptide, with or without the signal peptide, or (4) 
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£ 82%, 83% , 84 % , 85%> 86%> 87% , 88% , 89%> 

^^^, 98 ^ OMi ^_ ara%mdmo!(pref ^ iyM j2 

2^" tm ^ M *>*»»>'>«-* Ordina^.AATvaria* 7 

^».^, n „ reoflenatIeastal)out 3 0)40i50i60>7M0i 
amino acids in length, or more. 

a* p.* o me seqirence idenn* Amino acid ^ence aiignmen, precede „ dccrmine 
perec 0t ,d e n, tyarcw e Uto0TOtottMeofsMtatt ^ 0tei ™- 

com pm er S „ (l wa resU cha S Bl AS T, B I ASK , ALIGN2orMegaligI1 

^.^.oangnpepnde.eauence.InoscMedinte^cande^e 
^"^^formea^ 

acmcve^u^^^^^^^^^^^^ 

When amino acid fences are aiigned, the % amino acid aeone^e idenS* of a 

/.ammo identic to , ^ or asain5t a ^ ^ ^ 
can be calculated as: 



'oamino acid sequence identity = XfY ' 100 
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where 



X is the number of amino acid residues scored as identical matches by the 
sequence alignment program's or algorithm's alignment of A and B 
and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 

3. Isolated/purified polypeptides 

An "isolated" or "purified" polypeptide, protein or biologically active fragment is 
separated and/or recovered from a component of its natural environment Contaminant 
components include materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or 
non-proteinaceous material, Preferably, the polypeptide is purified to a sufficient degree 
to obtam at least 1 5 residues of N-terminal or internal amino acid sequence To be 
substantmlly isolated, preparations having less than 30% by dry weight of non-AAP 
contaxmnating material (contaminants), more preferably less than 20%, 10% and most 
preferably less than 5% contaminants. An isolated, recombinantly-produced AAP or 
bxologzcaUy active portion is preferably substantially free of culture medium, culture 
mednun represents less than 20%, more preferably less than about 10* and most 
preferably less than about 5% of the volume of the AAP preparatioa Examples of 
contammants include cell debris, culture media, and substances used and produced during 
in vitro synthesis of an AAP. 

4. Biologically active 

Biologically active portions of an AAP include peptides comprising amino acid 
sequences sufficiently homologous to or derived from the amino acid sequences of an 

AAP(SEQmNOS:2,4,6,8,10,12,14orl6)thatmcludefeweraminoacidsthana 
full-length AAP, and exhibit at least one activity of an AAP. Biologically active portions 
compose a domain or motif with at least one activity of a native AAP. A biologically 
active portion of an AAP can be a polypeptide that is, for example, 10,25,50, 100 or 
more ammo acid residue, in length. Other biologically active portions, in which other 
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retfonsof theprotein are deleted, bvrecombimBt 
•valued for one or more of a, toctioMl „ f , ^ ^ 

m SEQ n> KOS :2 , 4, 6, 8, 10, 12, 14 o, 16, or subtfaatiaUy ^ t0 s „ 
^NOS.2,4.6,8, .0, ,2, 14or, S , y « toifle rs i na mi noac i dseauenced„e«ona to l 

6,8, 10, 12, '♦wlS.andrelainsthe&ncaonalactivityofnattveAAP. 

»AAP variarirmeans an active AAPhavinga. teas,: (1) ^ 8 0%amhK, acid 

k*. ^^^, ( 3)anex^^do m ,„ofanAAP,wi t nor witt o„,l 

A^var^^olndeanAAP^^^^^^ ^ 
deleteda^N-orC-^of^^^^^ ^ 

abou.81/„ ammo aoid S eoue„ce identity, m ore preferably a, lea,, about 82% 83% 84-/ 
-dse^^^^^^^^^^^^^ 

the signal peptide, or any other fragment of a lull-length AAP sequence. Ordinarily AAP 

or 300 amino acids in length, or more. 

"Peisent (•/„) ^ Kid fe ^ ^ 

— * sequences are ajigned and if n«essary, gaps are unreduced to achieve die 
■hesequenceidentity. Amino Kid sequence ahgnment procures to determine ercen. 
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identity are well known to those of skill in the art Often publicly available computer 
software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) software is used 
to align peptide sequences. Those skilled in the art can determine appropriate parameters 
for measuring alignment, including any algorithms needed to achieve maximal alignment 
over the full length of the sequences being compared. 

When amino acid sequences are aligned, the % amino acid sequence identity of a 
given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence 
B) can be calculated as: 

%amino acid sequence identity = X/Y ' 100 
where 

X is the number of amino acid residues scored as identical matches by the 
sequence alignment program's or algorithm's alignment of A and B 
and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 

6. Chimeric and fusion proteins 

Fusion polypeptides are useful in expression studies, cell-localization, bioassays, 
and AAP purification. An AAP "chimeric protein" or "fusion protein" comprises an AAP 
fused to a non-AAP polypeptide. A non-AAP polypeptide is not substantially 
homologous to an AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16). An AAP fusion 
protein may include any portion to an entire AAP, including any number of the 
biologically active portions. An AAP may be fused to the C-terminus of the GST 
(glutathione S-transferase) sequences. Such fusion proteins facilitate the purification of a 
recombinant AAP. In certain host cells, (e.g. mammalian), heterologous signal 



WO 01/70808 



PCT/USOl/09609 



-84- 



sequences fusions may ameliorate AAP expression and/or secretion. Additional 
exemplary fusions are presented in Table C. 

Other fusion partners can adapt an AAP therapeutically. Fusions with members of 
the nxununoglobulin (Ig) protein family are useful in therapies that inhibit an AAP ligand 
or substrate interactions, consequently suppressing an AAP-mediated signal transduction 
« vrvo. Such fusions, incorporated into pharmaceutical compositions, may be used to 
treat prohferative and differentiation disorders, as well as modulating cell survival An 
AAP-Ig fusion polypeptides can also be used as immunogens to produce an anti-AAP 
Abs m a subject, to purify AAP Iigands, and to screen for molecules that inhibit 
interactions of an AAP with other molecules. 

Fusion proteins can be easily created using recombinant methods. A nucleic acid 
encodmga.A^ca.be^ 

AAP NH 2 - or COO- -terminus, or internally. Fusion genes may also be synthesized by 
conventional techniques, including automated DNA synthesizers. PGR amplification 
usmg^chor primers that giverise to complementary overhangs between two consecutive 
gene fragments that can subsequently be annealed and reamplified to generate a chimeric 
genesequenceCAusubel*,^ 

available that facilitate sub-cloning an AAP in-frame to a fusion moiety. 



Table C 



Reporter 
Human growth 
hormone (hGH) 


in vitro 

Radioimmuno- 
assay 


in vivo 

none 


Notes 
Expensive, 
insensitive, 
narrow linear 
range. 


Reference . 
(Seldene/a/., 
1986) 


p-glucu- 
ronidase (GUS) 

Green 


Colorimetric, 
fluorescent, or 
chemi- 
luminescent 


colorimetric 
(histo-chemical 
staining with X- 
gluc) 


sensitive, 
broad linear 
range, non- 
iostopic. 


(Gallagher, 
1992) 


fluorescent 
protein (GFP) 
and related 
molecules (REP, 
BFP, AAP, etcA 


Fluorescent 


fluorescent 



can be used in 
live cells; 
resists photo- 
bleaching 


(Chalfiee/a/., 
1994) 
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Luciferase 
(firefly) 



I bioluminsecent 



Bio- 

• luminescent 



Chloramphenico Chromato 



al 

acetyltransferas 
e (CAT) 



graphy, 
differential 
extraction, 
fluorescent, or 
immunoassay 



p-galacto-sidase j colorimetric, 
fluorescence, 
chemi- 
luminscence 



Secrete alkaline 
phosphatase 
(SEAP) 



colorimetric, 
bioluminescent, 
chemi- 

luminescent 



colorimetric 
! (histochemical 
| staining with X- 
gal), bio- 
luminescent in 
(live cells 



protein is 
unstable, 
difficult to 
reproduce, 
signal is brief 



Expensive 
radioactive 
substrates, 
time- 
consuming, 
insensitive, 
narrow linear 
range 



sensitive, 
broad linear 
range; some 
cells have high 
endogenous 
activity 



Chem- 

iluminscence 

assay is 

sensitive and 

>road linear 
range; some 
cells have 
endogenouse 
alkaline 
phosphatase 
activity 



(de Wet etal., 
1987) 



(Gorman et al, 
1982) 



(Alam and 
Cook, 1990) 



(Bergere/a/., 
:988) 



Therapeutic applications ofAAP 

1 . Agonists and antagonists 

"Antagonist" includes any molecule that partially or fully blocks, inhibits or 
neutrals a biological activity of an endogenous AAP. Similarly, "agonist" includes any 
molecule that mimics a biological activity of an endogenous AAP. Molecules that can act 
as agomsts or antagonists include Abs or antibody fragments, fragments or variants of an 
endogenous AAP, peptides, antisense oligonucleotides, small organic molecules, etc. 

2. Identifying antagonists and agonists 

To assay for antagonists, an AAP is added to, or expressed in, a cell along with 
the compound to be screened for a particular activity. If the compound inhibits the 
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activity of interest in the presence of an AAP tW/» 

■f a at* ■ esence or an AAP, that compound is an antagonist to the AAP - 

if an AAP acnvity is enhanced, the compound is an agonist. 

(a) Specific examples o/potential antagonists and agonist 
Exampfes of md ^ ^ (J) ^ ^ ^ 

nucleic acid aptamers. v ; 

sol ^-WM,,^,,^,^ 

molecule,, if they enhance an AAP activity, are samples of agonists 
Almost any anfflxrfy tha, aSects an AAP's function . 

•rr^TT E ^° f ^~--powon al n:„:Lt 

ft^Absntay.ea.ntany^^^^^^^^ 
Humanized Abs are also contemplated. 

fore ^r raaVely ' aP0 '" Mmte8 °^ OTag0 * tmaybea ^^P-<«n 

X: b T2^ mbmSAAe ^ —LnmtatL 
AAP may be constttuttvely activated and may act as an agonist 

orDNA^^ AOrDNAMntoCteMtoeffetaVem «^^«NA 

^Mnple, the 5 ' coding portion of an AAP sequence is used to design 
an^ense R N A oUgo„nc I eo a deof ft omabo u ,,„ to4 „ teep ^ taI ^ A 0 ^ 
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mo.ecnle into AAP (antisense) (Cohen, ,989; Otao l99I > ^ 
^l M d tocmatobedeUv ^ t0MUs ^ tattemi . seiisei!NAMi)N 
bcexpr^.v.^.oMH,^,,^,,,^^ ^^^^^ 

-10 and +10 posttions of the target gene nucleotide sequence, are preferred. 

Mozymes are enzymatic RNA moiecules capable of catalyzing the specific 
deavageofRNA R^zynWac, by sequence-specific hybridation L 

To inhibit transcription, tiiple-helix nudeic acids tha, are singie-stranded and 

compr.aedeoxynucleotideaarenseW^onis,, These ohgonucleotides are desi™, 
^^^^^^^^ ^ 

requiring stretches of purines orpyrimidines (WO 97/33551 lm) 

Aptamers are short oligonucleotide se,ue„ces that can be ns'ed to recognize and 
raybindahnostanyn^e. T„e sys^c evoiution ofhgar.ds by 

MO.TuerkandGo.d, >^> * PowerMac be used to Ms ™» aptamers 
Apumershaven.nydiagnostieandcunica, uses; aimos, any use in which an antibody 

diagnostic tests (Jayasena, 1999). 

nefa ™ dm ^P^Absa^antil»d,fag 01 en te ,suchasF. l or(P^ ta 
bmdnnmunospecifcallytoanyAAPepitopes. ' 

"Antibody (Ab) comprises single Aba directed against an AAP (anti-AAP Ab- 
— antibod^isobWned^apopnlationofsn bstanWyhon.oge.eons Abs 
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i.e., the ^dividual Abs comprising the population are identical except for possible 
naturally-occurring mutations that may be present in minor amounts. Exemplary Abs 
uiclude polyclonal (pAb), monoclonal (mAb), humanized, bi-specific (bsAb), and 
heteroconjugate Abs. 

1 • Polyclonal Abs (pAbs) 

Polyclonal Abs can be raised in a mammaUan host, for example, by one or more 
mjecuons of an immunogen and, if desired, an adjuvant Typically, the immunogen 
and/or adjuvant are injected in the mammal by multiple subcutaneous or intraperitoneal 
mjectons. The immunogen may include an AAP or a fusion protein. Examplesof 
adjuvants -lude Freund's complete and monophosphoryl Lipid A synme^^^ 
^corynomycolateCMPL-TDM). To improve the immune response, an immunogen may 
be conjugated to a protein that is immunogenic in the host, such as keyhole limpet 

hemocyanin (KLH), serU m albumin, bovine ^-^1^ 
. Protocols forantibodyproductionaredescribedbyCAusubele^/., 1987; Harlow and 

Lane, 1988). Alternatively, pAbs may be made in chickens, producing IgY molecules 

(Schadeef a/., 1996). 

2. Monoclonal Abs (mAbs) 

Anti-AAP mAbs may be prepared using hybridoma methods (Milstein and Cuello 
1983). Hybndoma methods comprise at least four steps: (1) immunizing a host or 
lymphocytes from a host; (2) harvesting the mAb secreting (or potentially secreting) 
lymphocytes, (3) fusing the lymphocytes to immortalized cells, and (4) selecting those 
cells that secrete the desired (anti-AAP) mAb. 

A mouse, rat, guinea pig, hamster, or other appropriate host is immunized to elicit 
lymphocytes that produce or are capable of producing Abs that will specifically bind to 
thennmunogea Alternatively, the lymphocytes may be immunized /„ vitro. If human 
cells are desired, peripheral blood lymphocytes (PBLs) are generally used; however 
spleen cells or lymphocytes from other mammalian sources are preferred The 
immunogen typically includes an AAP or a fusion protein. 

The lymphocytes are then fused with an immortalized cell line to form hybridoma 
cells, faahtated by a fusing agent such as polyethylene glycol (Coding, 1996) Rodent, 
bovme, or human myeloma cells immortalized by transformation may be used, or rat or 
mouse myeloma cell line, Because pure populations of hybridoma cells and not unfused 
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immortalized cells are preferred the celk after fi,ci™ 

v n-ea, me cells alter fusion are grown in a suitable medium that 

contams one or more substances that inhibit the growth or survival of unfused, 
memorialized cell, A common technique uses parental cells that lack the enzyme 
hypoxanthme guanine phosphoribosyl transferase (HGPRT or HPRT) In this case 
hypoxanthine, aminopterin and thymidine are added to the medium (HAT medium)' to 
prevent megrowmofHGPRT-deficientceUswMlepermitdn^ 

Preferred immortalized cells fuse efficiently, can be isolated from mixed 
populationsby selecting inamedium such as HAT, and support stable and high-level 
expression of antihody after fusion. Preferred immortalized cell lines are murine 
myelomalines, available from the American Type QUture Collection (Manassas, VA) 
Humanmyelomaandmouse-humanheteromyelomacell lines also have been described 
for the production ofhuman mAbs (Kozborrf al, 1984;Schook, 1987) 

Because hybridoma cells secrete antibody extracellularly, the culture media can be 
assayed for the presence of mAbs directed against an AAP (anti-AAP mAbs) 
Immunoprecipitation or *, vitro binding assays, such as radio immunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA), measure the binding specificity of 
^(HarlowandW, 1988; Harlow and I^e, 1999), including Scatchard analysis 
(Munson and Rodbard, 1 980). 

^•^^^^ghybridomacelkmaybei.ola.edas^eclonesby 
taHtrng dilution P»»*». and sub-cuitored (Godillg) ^ ^ ^ 

■nclude DulbeWs Modified Eagle's Medium, RPMI-1640, or if desired, a protein-free 
or -reduced or serum-free medium UUra DOMAPForHL-I; Biowhi«aker 
Waltavine.MD). The hybridoma cells may atao be ^ „ ^ „ ^ ' 

The mAbs may be isotated or purified from me culture medium or ascites fluid by 
— ^Ilgpunacau-ooprocadures such a, protein A-Sepha^e, hydroxyiapante 
chromatography, ge. electrophoresis, dialysis, amm„ mmnOTll a teprec , pitattonM 
eh ro malography(Harlowa n dLane,1988-HarIowandLane,loo 9) . 

™ e ™y be made by recombinant methods (U.S. Paten, No. 4166452 
1979). WAencodfaga.ti-AAPmAbscar.tereadilyisolatedandsequencedusm. ' 
conventional procedures, e. g ., using „Ii g0 uucleo.ide probes mat specificaliy bind to 
munn. heavy and Ugh, antibody chain genes, to prob e preferably DNA isolated from 
anh-AAP-secreting mAb hybridoma cell lines. Once isolated, the isolated DNA 
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hgments are sub-cloned int0 exprcKion ^ ^ m ^ ^ 

no,om™ S ep roduC eI gpn)t e^ toexp ^^ raeisoWedDNAiiagme]itsranbe 
modtfled, for example, by substituting me ^ fc f ^ ^ 

4 656, ^.>^Moni S oa«„ ; ., I987)>MbyftsingtheIgMdingsequMoetoaUo 

stated fortacon.tan.do^of^^^^^^^^^^ 

domams of on. antigen-combining site to create a chimeric bivalent antibody. 

3. Monovalent Abs 

^ A ^^ytemo n ovalen t Ab s tta,co nS e q „e M1 y (lonolcross . Iil]kl¥i[lleacli 
l!!"" P ' e ' 0 " m ^ faVOlVKrramb ^ ,e ^ M<, »S'*^and 

^tituted^ano.heran^oacid^dueorarede.eW.preventir^^nrtogA 
— methods are ^ ^ fcr prcpaii]g ^ ^ ^ ^ 

P^cefa^.ucbaaF^^^^^,^^^^ 

4- Humanized and human Abs 

^- A APAb S mayrarthercorr, 1 ri S eh m r^< )rhurnanAbsL HlImmized 
form, of non-human Abs „ chimeiic ^ , g ^ ^ ^ ^ 

F„ orotberantigen-bfadmg Sequences of Abs) that contain minimal seouenLe ' 

derived from non-human Ig. 

Generally, a humanized antibody Has one or more ammo acid residues introdnced 

import restdues, »hich are typicaliy taken from an "import- ■ variable domain. 
Homamzation is accomphshed by substituting rodent CDRs or CDR sequences for me 
conesponding seouencea of a human antibody (Jones m 6 ; Riechmanu « „; 

^trt*"". ,988) ' "'*~^««*«**.C<« .Patents 
4816567, m 9) , where m snbaailtialJy ^ ^ 

^*^°yfce correspond^ 
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FR res, u es a. subs^ted by raiduK fom ^ k 

Abs include human Igs (recipient antibody) in which residues from a complementary^ 

d^ regio n(C D R,ofme rc cipien,arerepW byI e si d U e S& omaCI,RofT„on- 
h^spec.esCdonoran^suchasmouse.tatot^havingthedesired 

« CTT* '"^^-—--—residues 
fences genet* me iu^nized antibody comprises suostatMyanofatieastone 
^.moseofanon-hun^Iaandmostifnot^ormeKR^onsaremoseofa 

resta, 1992; Riechmann e/ a/., 1988). 

""scandsobeprcducedu^g^ 
^^<H^bo„m^, 199I:Marksai , M991)andth » * 

^otAbsCBoetner^,^,^^^,^ 

hu™^ m^sgenicammais in which me endogenous Ig genes have bel . 

humans ,n a,, respec*. inCuding gene rearrangement, assemmy, and anu*ody repertoire 

5569825, 1996; U.S. Patent No. 5633425 1997- TT<3 p ♦ ♦ t.t ^ 

°' iyy7 » u s - Patent No. 5661016 1997- TT <5 

5. Bi-specific niAbs 

^AAP.meomertsforanyanhgenofcho^preferabiyaceU^pre^o, 
receptor or receptor subunit 

MtionaUy, the recombinant production of bi-specific Abs is based on the co- 
^erentspecficines Ostein and Cueiio, ,o 83 , Because of me random assoKmeo, o, 
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tectaqnes (WO 93/08829, 1993;Traunecker aiI f., 1991). 

of&e hinge, CH2, and CH3 region, ^.y, to| ^ 

organism. 

«U*re0VO96/270., ,996,. ^^tefc..^^ 

ynof „, ^bodycon^don* m oneMmoresma „ ^ 

Co m pe nsatMy ..cav ili e s »of aaiti ca,o rstaillrsfee 
*pUcmg targeam.no acid side chate^^e, ones ^ „ ^ 

Z!Zl™' heyieldof,heheterodtaerover — 

«< 1 9 85 ).F ragm ^a r e r ed m ed wiUl ad iW „l c „ mpto gage nt , suc ba s ^ iun , 

of 4e Fat-TNB denvauves fa to reconverted «o He F^o, by ^ 
mercaptoemytamine and is mixed wim an euaimolar amoun, of the other F^TNB 
derive ,o to .be bi-specinc anobody. * „ ^ m _ 

agents for the selechve immobilization of enzymes. ' 
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F ab - fragments may be directly recovered from R coli and chemically coupled to 
form bi-specific Abs. For example, fully humanized bi-specific F (ab . )2 Abs can be 
produced (Shalaby et al, 1992). Each F ab - fragment is separately secreted from E. coli 
and directly coupled chemically in vitro, forming the bi-specific antibody. 

Various techniques for making and isolating bi-specific antibody fragments 
directly from recombinant cell culture have also been described. For example, leucine 
zipper motifs can be exploited (Kostelny et al, 1992). Peptides from the Fos and Jun 
proteins are linked to the F ab . portions of two different Abs by gene fosioa The antibody 
homodimers are reduced at the hinge region to form monomers and then re-oxidized to 
form antibody heterodimers. This method can also produce antibody homodimers. The 
"diabody" technology (Holliger et al, 1993) provides an alternative method to generate 
bi-specific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (V L ) by a linker that is too short to allow 
pairing between the two domains on the same chain. The V H and V L domains of one 
fragment are forced to pair with the complementary V L and V„ domains of another 
fragment, forming two antigen-binding sites. Another strategy for making bi-specific 
antibody fragments is the use of single-chain F v (sF v ) dimers (Gruber et al, 1994). Abs 
with more than two valencies are also contemplated, such as tri-specific Abs (Tutt et al 
1991). 

Exemplary bi-specific Abs may bind to two different epitopes on a given AAP 
Alternatively, cellular defense mechanisms can be restricted to a particular cell expressing 
the particular AAP: an anti-AAP arm may be combined with an arm that binds to a 
leukocyte triggering molecule, such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, 
or B7), or to F c receptors for IgG (F c yR), such as F c yRI (CD64), F c yRII (CD32) and 
FcyRIII (CD16). Bi-specific Abs may also be used to target cytotoxic agents to cells that 
express a particular AAP. These Abs possess an AAP-binding arm and an arm that binds 
a cytotoxic agent or a radionuclide chelator. 
6. Heteroconjugate Abs 

Heteroconjugate Abs, consisting of two covalently joined Abs, have been 
proposed to target immune system cells to unwanted cells (4,676,980, 1987) and for 
treatment of human immunodeficiency virus (HIV) infection (WO 91/00360, 1991; WO 
92/20373, 1992). Abs prepared in vitro using synthetic protein chemistry methods,' 
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including those involving cross-linking agents, are contemplated For example, 
immunotoxins may be constructed using a disulfide exchange reaction or by forming a 
thioether bond. Examples of suitable reagents include irninothiolate and methyl-4- 
mercaptobutyrirnidate (4,676,980, 1987). 
7. Immunoconjugates 

Immunoconjugates may comprise an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin {eg., an enzymatically active toxin or fragment 
of bacterial, fungal, plant, or animal origin), or a radioactive isotope (/.<?., a 
radioconjugate). 

Useful enzymatically-active toxins and fragments include Diphtheria A chain, 
non-binding active fragments of Diphtheria toxin, exotoxin A chain from Pseudomonas 
aeruginosa, ricin A chain, abrin A chain, modeccin A chain, a-sarcin, Aleurites fordii 
proteins, Dianthin proteins, Phytolaca americana proteins, Momordica charantia 
inhibitor, curcin, crotin, Sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, 
phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available 
for the production of radioconjugated Abs, such as 212 Bi, 131 1, 131 Ih, 90 Y, and ,86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bi- 
functional protein-coupling agents, such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bi-functional derivatives of imidoesters (such as 
dimethyl adipimidate HC1), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), 6w-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), Ws-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
emylenediamine), diisocyanates (such as tolyene 2,6- diisocyanate), and to-active 
fluorine compounds (such as l,5-<nfluorc~2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared (Vitetta et al., 1987). I4 C-labeled 1-isothiocyanatobenzyl- 
3-methyldiethyIene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating 
agent for conjugating radionuclide to antibody (WO 94/1 1026, 1994). 

In another embodiment, the antibody may be conjugated to a "receptor" (such as 
streptavidin) for utilization in tumor pre-targeting wherein the antibody-receptor 
conjugate is administered to the patient, followed by removal of unbound conjugate from 
the circulation using a clearing agent and then administration of a streptavidin "ligand" 
(e.g., biotin) that is conjugated to a cytotoxic agent (e.g., a radionuclide). 
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8. Effector function engineering 

The antibody can be modified to enhance its effectiveness in treating a disease, 
such as cancer. For example, cysteine residue(s) may be introduced into the F c region, 
thereby allowing interchain disulfide bond formation in this region. Such homodimeric 
Abs may have improved internalization capability and/or increased complement-mediated 
cell killing and antibody-dependent cellular cytotoxicity (ADCC) (Caron et aL, 1992; 
Shopes, 1992). Homodimeric Abs with enhanced anti-tumor activity can be prepared 
using hetero-bifiinctional cross-linkers (Wolff aL, 1993). Alternatively, an antibody 
engineered with dual F c regions may have enhanced complement lysis (Stevenson et aL, 
1989). 

9. Immunoliposomes 

Liposomes containing the antibody may also be formulated (U.S. Patent No. 
4485045, 1984; U.S. Patent No. 4544545, 1985; U.S. Patent No. 5013556, 1991; Eppstein 
et aL, 1985; Hwang et aL, 1980). Useful liposomes can be generated by a reverse-phase 
evaporation method with a lipid composition comprising phosphatidylcholine, 
cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- PE). Such 
preparations are extruded through filters of defined pore size to yield liposomes with a 
desired diameter. Fab' fragments of the antibody can be conjugated to the liposomes 
(Martin and Papahadjopoulos, 1982) via a disulfide-interchange reaction A 
chemotherapeutic agent, such as Doxorubicin, may also be contained in the liposome 
(Gabizon et aL, 1989). Other useful liposomes with different compositions are 
contemplated. 

1 0. Diagnostic applications of Abs directed against an AAP 

Anti-AAP Abs can be used to localize and/or quantitate anAAP (e.g, for use in 
measuring levels of an AAP within tissue samples or for use in diagnostic methods, eta). 
Anti-AAP epitope Abs can be utilized as pharmacologically-active compounds. 

Anti-AAP Abs can be used to isolate an AAP by standard techniques, such as 
immunoafiFinity chromatography or immunoprecipitation These approaches facilitate 
purifying an endogenous AAP antigen-containing polypeptides from cells and tissues. 
These approaches, as well as others, can be used to detect an AAP in a sample to evaluate 
the abundance and pattern of expression of the antigenic protein. Anti-AAP Abs can be 
used to monitor protein levels in tissues as part of a clinical testing procedure; for 
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example, to determine the efficacy of a given treatment regimen Coupling the antibody 
to a detectable substance (label) allows detection of Ab-antigen complexes. Classes of 
labels include fluorescent, luminescent, bioluminescent, and radioactive materials, 
enzymes and prosthetic groups. Useful labels include horseradish peroxidase, alkaline 
phosphatase, (3-galactosidase, acetylcholinesterase, streptavidin/biotin, avidin/biotin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride, phycoerythrin, luminol, luciferase, luciferin, aequorin, and 

1 1 . Antibody therapeutics 

Abs of the invention, including polyclonal, monoclonal; humanized and fully 
human Abs, can be used therapeutically. Such agents will generally be employed to treat 
or prevent a disease or pathology in a subject An antibody preparation, preferably one 
having high antigen specificity and affinity generally mediates an effect by binding the 
target epitope(s). Generally, administration of such Abs may mediate one of two effects: 
(1) the antibody may prevent ligand binding, eliminating endogenous ligand binding and 
subsequent signal transduction, or (2) the antibody elicits a physiological result by 
binding an effector site on the target molecule, initiating signal transduction. 

A therapeutically effective amount of an antibody relates generally to the amount 
needed to achieve a therapeutic objective, epitope binding affinity, administration rate, 
and depletion rate of the antibody from a subject Common ranges for therapeutically 
effective doses may be, as a nonlimiting example, from about 0.1 mg/kg body weight to 
about 50 mg/kg body weight Dosing frequencies may range, for example, from twice 
daily to once a week. 

12. Pharmaceutical compositions of Abs 

Anti-AAP Abs, as well as other AAP interacting molecules (such as aptamers) 
identified in other assays, can be administered in pharmaceutical compositions to treat 
various disorders. Principles and considerations involved in preparing such compositions, 
as well as guidance in the choice of components can be found in (de Boer, 1994; 
Gennaro, 2000; Lee, 1990). 

Since some AAP are intracellular, Abs that are internalized are preferred used 
when whole Abs are used as inhibitors. Liposomes may also be used as a delivery 
vehicle for intracellular introduction Where antibody fragments are used, the smallest 
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inhibitory fragment that specifically binds to the epitope is preferred. For example 
peptide molecules can be designed that bind a preferred epitope based on the variable- 
region sequences of a useful antibody. Such peptides can be synthesized chemically 
and/or produced by recombinant DNA technology (Marasco * al, 1993). Formulations 
may also contain more than one active compound for a particular treatment, preferably 
those with activities that do not adversely affect each other. The composition may 
comprise an agent that enhances function, such as a cytotoxic agent, cytokine, 
chemotherapeutic agent, or growth-inhibitory agent. 

The active ingredients can also be entrapped in microcapsules prepared by 
coacervation techniques or by interfacial polymerization; for example, 
hydroxymethylceUulose or gelatm-microcapsules and poly-(methylmethacrylate) 
rmcrocapsules, respectively, in colloidal drug delivery systems (for example, liposomes 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be used for in vivo administration are highly preferred to be 
stenle. This is readily accomplished by filtration through sterile filtration membranes or 
any of a number of techniques. 

Sustained-release preparations may also be prepared, such as semi-permeable 
matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release 
matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacryiate) 
or poly(vmylaIcohol)), polylactides (Boswell and Scribner, U.S. Patent No. 3,773 919 ' 
1973), copolymers of ^glutamic acid and y ethyl-L-glutamate, non-degradab'le etiiylene- 
vmyl acetate, degradable lactic acid-glycolic acid copolymers such as injectable 
microspheres composed of lactic acid-glycolic acid copolymer, and poly-D-(->3- 
hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid- 
glycolic acid enable release of molecules for over 100 days, certain hydrogels release 
proteins for shorter time periods and may be preferred. 



AAP recombinant expression vectors and host cells 

Vectors are tools used to shuttle DNA between host cells or as a means to express 
a nucleotide sequence. Some vectors function only in prokaryotes, while others function 
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in both prokaryotes and eukaryotes, enabling large-scale DNA preparation from 
prokaryotes for expression in eukaryotes. Inserting the DNA of interest, such as an AAP 
nucleotide sequence or a fragment, is accomplished by ligation techniques and/or mating 
protocols well-known to the skilled artisan. Such DNA is inserted such that its 
integration does not disrupt any necessary components of the vector. In the case of 
vectors that are used to express the inserted DNA protein, the introduced DNA is 
operably-linked to the vector elements that govern its transcription and translation. 

Vectors can be divided into two general classes: Cloning vectors are replicating 
plasmid or phage with regions that are non-essential for propagation in an appropriate 
host cell, and into which foreign DNA can be inserted; the foreign DNA is replicated and 
propagated as if it were a component of the vector. An expression vector (such as a 
plasmid, yeast, or animal virus genome) is used to introduce foreign genetic material into 
a host cell or tissue in order to transcribe and translate the foreign DNA In expression 
vectors, the introduced DNA is operably-linked to elements, such as promoters, that 
signal to the host cell to transcribe the inserted DNA Some promoters are exceptionally 
useful, such as inducible promoters that control gene transcription in response to specific 
factors. Operably-linking an AAP or anti-sense construct to an inducible promoter can 
control the expression of an AAP or fragments, or anti-sense constructs. Examples of 
classic inducible promoters include those that are responsive to a-interferon, heat-shock, 
heavy metal ions, and steroids such as glucocorticoids (Kaufman, 1990) and tetracycline. 
Other desirable inducible promoters include those that are not endogenous to the cells in 
which the construct is being introduced, but, however, is responsive in those cells when 
the induction agent is exogenously supplied. 

Vectors have many difference manifestations. A "plasmid" is a circular double 
stranded DNA molecule into which additional DNA segments can be introduced. Viral 
vectors can accept additional DNA segments into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell (e.g., bacterial vectors having a bacterial 
origin of replication and episomal mammalian vectors). Other vectors {e.g. , 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome In 
general, useful expression vectors are often plasmids. However, other forms of 
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expression vectors, such as viral vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) are contemplated 

Recombinant expression vectors that comprise an AAP (or fragments) regulate an 
AAP transcription by exploiting one or more host cell-responsive (or that can be 
manipulated in vitro) regulatory sequences that is operably-linked to an AAP. "Operably- 
linked" indicates that a nucleotide sequence of interest is linked to regulatory sequences 
such that expression of the nucleotide sequence is achieved. 

Vectors can be introduced in a variety of organisms and/or cells (Table D). 
Alternatively, the vectors can be transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 



Table D Examples of hosts for cloning or expression 



Organisms 


Examples 


Sources and References* 


Prokaryotes 


E. coli 

K 12 strain MM294 

X1776 

W3110 

K5 772 


ATCC 31,446 
ATCC 31,537 
ATCC 27,325 
ATCC 53,635 


Enterobacteriaceae 


Enterobacter 




Erwinia 




Klebsiella 




Proteus 




Salmonella (S. tyhpimurium) 




Serratia (S. marcescans) 




Shigella 




Bacilli (B. subtilis and B. 
licheniformis) 




Pseudomonas (P. aeruginosa) 




Streptomyces 




Eukaryotes 


Saccharomyces cerevisiae 




Yeasts 


Schizosaccharomyces pombe 




Kluyveromyces 
K lactis MW98-8C, 
CBS683, CBS4574 
K.fragilis 
K. bulgaricus 
K. wickeramii 


(Fleer et a/., 1991) 

(de Louvencourt et aL, 1983) 

ATCC 12,424 
ATCC 16,045 
ATCC 24,178 
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Table D Examples of hosts for cloning or expression 



Organisms 



Filamentous Fungi 



Invertebrate cells 



Vertebrate cells 



Examples 



K. waltii -.j 
K. drosophilarum 
K. thermotolerans 
K. marxianus; yarrowia 



Pichia pastoris 



Candida 



Trichoderma reesia 



Neurospora crassa 



Torulopsis 



Rhodotorula 



Schwanniomyces (5. 
occidentalis) 



Neurospora 



Penicillium 



Tolyyocladium 



Aspergillus {A. nidulans and 
A. niger) 
Drosophila S2 



Svodoptera Sf9 



Chinese Hamster Ovary 
(CHO) 



simian COS 
COS-7 



IHEK293 i_ — 

•Unreferenced cells are generally available from American Type Culture 
(Manassas, VA). . . — . 

Vector choice is dictated by the organism or cells'being used and the desired fate 
of the vector. Vectors may replicate once in the target cells, or maybe "suicide" vectors. 
In general, vectors comprise signal sequences, origins of replication, marker genes, 
enhancer elements, promoters, and transcription termination sequences. The choice of 
these elements depends on the organisms in which the vector will be used and are easily 
. determined. Some of these elements may be conditional, such as an inducible or 
conditional promoter that is turned "on" when conditions are appropriate. Examples of 
inducible promoters include those that are tissue-specific, which relegate expression to 
certain cell types, steroid-responsive, or heat-shock reactive. Some bacterial repression 
systems, such as the lac operon, have been exploited in mammalian cells and transgenic 
animals (Fieck et al., 1992; Wyborski et al, 1996; Wyborski and Short, 1991). Vectors 



Sources and References* 



ATCC 56,500 
ATCC 36,906 

(EPO 402226, 1990) 



(Sreekrishna et al, 1988) 



(Case e* a/., 1979) 



fWQ 91/00357, 1991) 

(Kelly and Hynes, 1985; Tilburn 
et aL 1983; Yelton et al., 1984) 



ATCC CRL 1651 



Collection 
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often use a selectable marker to facilitate identifying those cells that have incorporated 
the vector. Many selectable markers are well known in the art for the use with 
prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and auxotrophy 
mutants. 

Using antisense and sense AAP oligonucleotides can prevent an AAP polypeptide 
expression These oligonucleotides bind to target nucleic acid sequences, forming 
duplexes that block transcription or translation of the target sequence by enhancing 
degradation of the duplexes, tenninating prematurely transcription or translation, or by 
other means. 

Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA 
or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) 
sequences. According to the present invention, antisense or sense oligonucleotides 
comprise a fragment of an AAP DNA coding region of at least about 14 nucleotides, 
preferably from about 14 to 30 nucleotides. In general, antisense RNA or DNA 
molecules can comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 85, 90, 95, 100 bases in length or more. Among others, (Stein and Cohen, 1988; van 
der Krol et al, 1988a) describe methods to derive antisense or a sense oligonucleotides 
from a given cDNA sequence. 

Modifications of antisense and sense oligonucleotides can augment their 
effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 
91/06629, 1991), increase in vivo stability by conferring resistance to endogenous 
nucleases without disrupting binding specificity to target sequences. Other modifications 
can increase the affinities of the oligonucleotides for their targets, such as covalently 
linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments 
modify binding specificities of the oligonucleotides for their targets, including metal 
complexes or intercalating (e.g. ellipticine) and alkylating agents. 

To introduce antisense or sense oligonucleotides into target cells (cells containing 
the target nucleic acid sequence), any gene transfer method may be used and are well 
known to those of skill in the art. Examples of gene transfer methods include 1) 
biological, such as gene transfer vectors like Epstein-Barr virus or conjugating the 
exogenous DNA to a hgand-binding molecule (WO 91/04753, 1991), 2) physical, such as 
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electroporation, and 3) chemical, such as CaP0 4 precipitation and ohgonucleotide-lipid 
complexes (WO 90/10448, 1990). 

The terms "host cell" and "recombinant host cell" are used interchangeably. Such 
terms refer not only to a particular subject cell but also to the progeny or potential 
progeny of such a cell Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are 
well known in the art The choice of host cell will dictate the preferred technique for 
introducing the nucleic acid of interest Table E, which is not meant to be limiting, 
summarizes many of the known techniques in the art Introduction of nucleic acids^ into 
an organism may also be done with ex vivo techniques that use an in vitro method of 
transfection, as well as established genetic techniques, if any, for that particular organism. 

Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 






Prokaryotes 
(bacteria) 


Calcium chloride 


References 
(Cohens a/., 1972; 
Hanahan, 1983; Mandel and 
Higa, 1970) 


Notes 


Electroporation 


(Shigekawa and Dower, 
1988) 




Eukaryotes 

Mammalian 
cells 


Calcium phosphate 
transfection 


iV-(2- 

Hydroxye%l)piperazine-iV'- 
(2-ethanesulfonic acid 
(HEPES) buffered saline 
solution (Chen and 
Okayama, 1988; Graham and 
van der Eb, 1973; Wigler et 
aU 1978) 

BES W^-bis(2- 
hydroxyethyl)-2- 
aminoethanesulfonic acid) 
buffered solution (Ishiura et 
aL t 1982) 


Cells may be 
"shocked" with 
glycerol or 
dimethykulfoxide 
(DMSO) to increase 
transfection 
efficiency (Ausubel 
etal, 1987). 
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E Methods to introduce nucleic acid into cells 



Cells 


Methods 

Diethylaminoethyl 

(DEAE)-Dextran 

transfection 


References 1 ^„ 

Most useful for 

(Fujita et al, 1986; Lopata et transie nt» but not 
al, 1984; Selden et al, stable > fransfections. 
1986) ' J Chloroquine can be i 

I used to increase 
-jrz — . . 1 efficiency. 


Electroporation 


(Weumanne^a/., 1982; 

Potter, 1988; Potter et al, Especially useful for 

1984; Wong and Neumann, ^-to-transfect 
1982) lymphocytes. 


Cationic lipid 

reagent 

transfection 

Retroviral 

j 

Polybrene 1 


^lroy-Stein and Moss, 1990; Ap^abiilo^otoH 

al.,1991:Whittet„; 7oo^J^^ fi|||i 

Production exemplified by 1 : 

(Cepko et al, 1984; Miller 

and Buttimnrp i oc<- tj«„„ . t 

Xl^T Lengthyprocess, 

Mection in vitro and in vivo: ffiSSS 
(Austin and Cepko, 1990; ATCC. Applicable 
fn^r^o" 1 ^ 6 ^ 6 tobothz^Voand 

1990; Williams etaL 1984^ 

;unaney^a/.,l986;Kawai | 

md Nishizawa, 1984) 


Can be used to 

Microinjection (Capecchi 1980) establish cell lines 
v i^ni, carrying integrated 
J | copies of AAP DNA | 
h= sequences. 




iw i • IKassoulzadegan et al, 1982: " 

Protoplast fusion Sandri-Goldine/a/., 1981; 
Schaffher, 1980) 


Insect cells ] 
(in vitro) 5 

I 


Useful lor in vitro 
Baculovirus (Luckow, 1991; Miller production of 
systems i 988; O'Reilly et al, 1992) Proteins with 
J eukaryotic 


Yeast 1 


jthium acetate ( Gietz * a/ -> 1998; Ito et al, 

1983) 


S 


pheroplast fusion | lm > Hinnen et al, | Laborious, can" ( 

' 1 produce aneuDloids 
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Cells 



TableE Methods to introduce nucleic acid into cells 



Plant cells 
(general 
reference: 
(Hansen and 
Wright, 
1999)) 



Agrobacterium 
transformation 

Biolistics 
(microprojectiles) 



Electroporation 
(protoplasts) 



Polyethylene 
glycol (PEG) 
treatment 



(BechtoldandPelletier, 
1998; Escudero and Hohn, 
1997; Hansen and Chilton! 
1999; Touraev and al 1 997 

and Chilton, 1999; Shillito, 

I et al, 1986; Rhodes et al, 
1988; Saunders etal, 1989) 
May be combined with 
liposomes (Trick and al 

1 1997) 



(Shillito, 1999) 



liposomes 



in planta 
microinjection 



May be combined with 

electroporation (Trick and 
LaUj997)_ 



(Leducandal., 1996; Zhou 
and ah, 1983^ 



Silicon carbide 
whiskers 



(Thompson and al., 1995) 



Notes 



TableF UsrfW selectable foreukayce 



cell transfection 



Selectable Marker 

Adenosine deaminase 
(ADA) 



Selection 



Media includes 9-p-D- 
xylofuranosyl adenine 
(Xyl-A) 



Action 

Conversion of XyfAto 
Xyl-ATP, which 
incorporates into 
SHElejcacidsJc^ 



Reference 



(Kaufman et 
al, 1986) 
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Table F Useful selectable markers for eukaryote cell transfection 



Selectable Marker 



Dihydrofolate 
reductase (DHFR) 



Aminoglycoside 
phosphotransferase 
("APHV'neo", 
"G418") 



Hygromycin-B- 

phosphotransferase 
(HPH) 



Thymidine kinase 
(HQ- 



Selection 



Action 



Methotrexate (MTX) 
and dialyzed serum 
(purine-free media) 



cells. ADA detoxifies 
MTX competitive 
inhibitor of DHFR. In 
absence of exogenous 
purines, cells require 
DHFR, a necessary 
enzyme in purine 
biosynthesis. 



G418 



G418, an 
! aminoglycoside 
I detoxified by APH, 

interferes with 
| ribosomal function and 
consequently, 
' translation 



Reference 



(Simonsen 
and 

Levinson, 
1983) 



hygromycin-B 



Forward selection 

(TK+): Media (HAT) 

incorporates 

' aminopterin. 

Reverse selection (TK- 

): Media incorporates 

5-bromodeoxyuridine 
(BrdU). 



Hygromycin-B, an 
aminocyclitol 
detoxified by HPH, 
disrupts protein 
translocation and 
promotes 
mistranslation. 



i 'Southern 
and Berg, 
1982) 



(Palmer et 
a/., 1987) 



Forward: Aminopterin 
forces cells to synthesze 
dTTP from thymidine, a 
pathway requiring TKL 
Reverse: TK 
phosphorylates BidU, 
which incorporates into 
nucleic acids, killing 
cells. 



(Littlefield, 
1964) 



A host cell of the invention, such as a prokaryotic or eukaxyotic host cell in 
culture, can be used to produce an AAP. Accordingly, the ir 
for producing an AAP using the host cells of the 



! invention provides methods 
invention. In one embodiment, the 



method comprises cutaring to host mU of fc ^ ^ ^ 

«~^-^-A» tab . tal-11 ^ llI . <lltab| , 

^..P^cedhanoto^taen,,^^^^^ 
AAP from the medium or the host cell. 
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Transgenic AAP animals 

Transgenic animal, ar, useful for ^ fc ^ 
- for . entttymg and/or evaluating module of ^ * 

-ludepnmates, sheep, dogs, cows, goa*. chickens, a^hibian, ac 
^a.deve.ops.^^r^^e^^^ ^ 

^onofreco^^ A "homologous recombinant animaT isa 

exogenous DNA molecule that recombines homologously with an endogenous AAP 
-be used to praI „ce non-human transgenic animals, such as ^ or 

recombinant animals. 

of f iT^ C ™' CmbeCKa,edbyin ' rod "™^^-t"".emalep I „„„ ctei 
of forced oocy* ^. by micoin^ „ ~ 

oc^g^an,^^^^^^^ fctronicseouencesand 

«*«— • ^'atorys^uencescanbeoperably-unltedtome^. 
^etodnectexpressionoftte^^art^^Me^dsforgeueraSng 

asrmce,havebec M econvend ra a,mmear t «.,. ( Hvau s « a ,.,us.Pa,ent N „ 
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4,870,009, ,989; Hogan, 0879693843, ,994; Leder and Stewart, US. P a tent No 
4,736,866, ,988; Wagner and Hoppe, US Patent No. 4,873,191,1989). Othernon-mice 

can be nsed to breed additional transgenic animals, can be identified based upon the 
presence of the transgene in its genome and/or expression of the transgene mRNA in 
fssues or ceUs of the animal, Transgenic animafc can be bred to otter tmagenic 
animals carrying other transgenes. 

2. Vectors for transgenic animal production 

° f -^°wfcchadeletion,ad^^ 

alter.e.^.disruptoraltertheexpressioaof.an^P. Andean be a murine gene or- 

vector functxonaUy disrupts an endogenous ^ geneuponhomoIogousrecomb 
and thus a non-functional AAP protein, if any, is expressed. 

Alternatively, the vector can be designed such that, upon homologous 
recombmation, an endogenous AAP is mutated or otherwise altered but still encodes 
luncrzonal protein (e.g., the upstream regulatory region can be altered to thereby alter the 
expression of an endogenous AAP). In this type of homologous recombination vector 
the^^ 

ofthe^Pto allow for homologous recombination to occur between the exogenous^" 
carnedbythe vector and an endogenous^ man embryonic stem cell The additional 
flankmg AAP nucleic acid is sufficient to engender homologous recombination with the 
endogenous AAP. Typically, several kilobases of flanking DNA (both at the 5'- and 3'- 
term 1 m)ar e mcludedmthevector(ThomasandCa P ecchi, 1987). The vector is then 
Produced into an embryonic stem cell line (,^., by electroporation), and cells in which 

3. Introduction of an AAP transgene cells during development 
Selected cells are then injected into a blastocyst of an animal (eg:, a mouse) to 
form aggregation chimeras (Bradley, 1 987). A chimeric embryo can then be implanted 
mto a suitable pff a and the embryo brought to term Progeny harboring the 
homologously-recombinedDNA in their germ cells can be used to breed animals in 
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which all cells of the animal contain the homologously-recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described (Berns et al, WO 93/04169, 
1993; Bradley, 1991; Kucherlapati et al, WO 91/01 140, 1991; Le Mouellic and Brullet, 
WO 90/11354, 1990). 

Alternatively, transgenic animals that contain selected systems that allow for 
regulated expression of the transgene can be produced. An example of such a system is 
the cre/loxP recombinase system of bacteriophage PI (Lakso et al, 1992). Another 
recombinase system is the FLP recombinase system of Saccharomyces cerevisiae 
(O'Gorman et al, 1991). If a cre/loxP recombinase system is used to regulate expression 
of the transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such arumals can be produced as "double" transgenic 
animals, by mating an animal containing a transgene encoding a selected protein to 
another containing a transgene encoding a recombinase. 

Clones of transgenic animals can also be produced (Wilmut et al, 1 997). In brief, 
a cell from a transgenic animal can be isolated and induced to exit the growth cycle and 
enter Go phase. The quiescent cell can then be fused to an enucleated oocyte from an 
animal of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured to develop to a morula or blastocyte and then transferred to a pffa. 
The offspring borne of this female foster animal will be a clone of the "parent" transgenic 
animal. 

Pharmaceutical compositions 

The AAP nucleic acid molecules, AAP polypeptides, and anti-AAP Abs (active 
compounds) of the invention, and derivatives, fragments, analogs and homologs thereof, 
can be incorporated into pharmaceutical compositions. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. A "pharmaceutical^ acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, and the like, compatible with pharmaceutical adrninistration (Gennaro, 
2000). Preferred examples of such carriers or diluents include, but are not limited to, 
water, saline, finger's solutions, dextrose solution, and 5% human serum albumin. 
Liposomes and non-aqueous vehicles such as fixed oils may also be used Except when a 
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conventional media or agent is incompatible with an active compound, use of these 
compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

1 . General considerations 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration, including intravenous, intradermal, 
subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include: a sterile diluent such as water for injection, saline 
solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 
such as ascorbic acid or sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH 
can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

2. Injectable formulations 

Pharmaceutical compositions suitable for injection include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersioa For intravenous 
administration, suitable carriers include physiological saline, bacteriostatic water, 
CREMOPHOR EL™ (BASF, Parsippany, NJ.) or phosphate buffered saline (PBS). In all 
cases, the composition must be sterile and should be fluid so as to be administered using a 
syringe. Such compositions should be stable during manufacture and storage and must be 
preserved against contamination from microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (such as glycerol, propylene glycol, and liquid polyethylene glycol), and suitable 
mixtures. Proper fluidity can be maintained, for example, by using a coating such as 
lecithin, by maintaining the required particle size in the case of dispersion and by using 
surfactants. Various antibacterial and antifungal agents, for example, parabens, . 
chlorobutanol, phenol, ascorbic acid, and thimerosal, can contain microorganism 
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contaminatioa Isotonic agents, for example, sugars, polyalcohols such as manitol, 
sorbitol, and sodium chloride can be included in the composition. Compositions that can 
delay absorption include agents such as aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
an AAP or anti-AAP antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients as required, followed by sterilizatioa 
Generally, dispersions are prepared by incorporating the active compound into a sterile 
vehicle that contains a basic dispersion medium, and the other required ingredients as 
discussed Sterile powders for the preparation of sterile injectable solutions, methods of 
preparation include vacuum drying and freeze-drying that yield a powder containing the 
active ingredient and any desired ingredient from a sterile solutions. 

3. Oral compositions 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally. Phannaceutically compatible binding agents, and/or adjuvant materials 
can be included. Tablets, pills, capsules, troches and the like can contain any of the , 
following ingredients, or compounds of a similar nature: a binder such as microcrystalline 
cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, PRIMOGEL, or corn starch; a lubricant such as 
magnesium stearate or STEROTES; a glidant such as colloidal silicon dioxide; a . 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

4. Compositions for inhalation 

For administration by inhalation, the compounds are delivered as an aerosol spray 
from a nebulizer or a pressurized container that contains a suitable propellant, a gas 
such as carbon dioxide. 

5 . Systemic administration 

Systemic administration can also be transmucosal or transdermal. For 
transmucosal or transdermal administration, penetrants that can permeate the target 
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barrier(s) are selected Transmucosal penetrants include, detergents, bile salts, and 
fusidic acid derivatives. Nasal sprays or suppositories can be used for transmucosal 
administration. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams. 

The compounds can also be prepared in the form of suppositories (e.g., with bases 
such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

6. Carriers 

In one embodiment, the active compounds are prepared with carriers that protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid Such materials can be 
obtained commercially from ALZA Corporation (Mountain View, CA) and NOVA 
Pharmaceuticals, Inc. (Lake Elsinore, CA), or prepared by one of skill in the art 
Liposomal suspensions can also be used as pharmaceutical^ acceptable carriers. These 
can be prepared according to methods known to those skilled in the art, such as in 
(Eppstein et al, US Patent No. 4,522,81 1, 1985). 

7. Unit dosage 

Oral formulations or parenteral compositions in unit dosage form can be created to 
facilitate administration and dosage uniformity. Unit dosage form refers to physically 
discrete units suited as single dosages for the subject to be treated, containing a 
therapeutically effective quantity of active compound in association with the required 
pharmaceutical carrier. The specification for the unit dosage forms of the invention are 
dictated by, and directly dependent on, the unique characteristics of the active compound 
and the particular desired therapeutic effect, and the inherent limitations of compounding 
the active compound 

8. Gene therapy compositions 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (Nabel and Nabel, US Patent No. 
5,328,470, 1 994), or by stereotactic injection (Chen et al. , 1 994). The pharmaceutical 
preparation of a gene therapy vector can include an acceptable diluent, or can comprise a 
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slow release matrix in which the gene delivery vehicle is imbedded Alternatively, where 
the complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

9. Dosage 

The pharmaceutical composition and method of the present invention may further 
comprise other therapeutically active compounds as noted herein that are usually applied' 
in the treatment of the above mentioned pathological conditions. 

In the treatment or prevention of conditions which require AAP modulation an 
appropriate dosage level will generally be about 0.01 to 500 mg per kg patient body 
weight per day which can be administered in single or multiple doses. Preferably, the 
dosage level will be about 0.1 to about 250 mg/kg per day; more preferably about 0.5 to 
about 100 mg/kg per day. A suitable dosage level may be about 0.01 to 250 mg/kg per 
day, about 0.05 to 100 mg/kg per day, or about 0.1 to 50 mg/kg per day. Within this 
range the dosage may be 0.05 to 0.5, 0.5 to 5 or 5 to 50 mg/kg per day. For oral 
administration, the compositions are preferably provided in the form of tablets containing 
1.0 to 1000 milligrams of the active ingredient, particularly 1.0, 5.0, 10.0, 15.0. 20.0, 
25.0, 50.0, 75.0, 100.0, 150.0, 200.0, 250.0, 300.0, 400.0, 500.0, 600.0, 750.0, 800.0, 
900.0, and 1000.0 milligrams of the active ingredient for the symptomatic adjustment of 
the dosage to the patient to be treated. The compounds may be administered on a 
regimen of 1 to 4 times per day, preferably once or twice per day. 

It will be understood, however, that the specific dose level and frequency of 
dosage for any particular patient may be varied and will depend upon a variety of factors 
including the activity of the specific compound employed, the metabolic stability and 
length of action of that compound, the age, body weight, general health, sex, diet, mode 
and time of administration, rate of excretion, drug combination, the severity of the 
particular condition, and the host undergoing therapy. 

1 0. Kits for pharmaceutical compositions 

The pharmaceutical compositions can be included in a kit, container, pack, or 
dispenser together with instructions for administration. When the invention is supplied as 
a kit, the different components of the composition may be packaged in separate containers 
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and admixed immediately before use. Such packaging of the components separately may 
permit long-term storage without losing the active components' functions. 

Kits may also include reagents in separate containers that facilitate the execution 
of a specific test, such as diagnostic tests or tissue typing. For example, AAP DNA 
templates and suitable primers may be supplied for internal controls. 

(a) Containers or vessels 

The reagents included in the kits can be supplied in containers of any sort such 
that the life of the different components are preserved, and are not adsorbed or altered by 
the materials of the container. For example, sealed glass ampules may contain 
lyophilized luciferase or buffer that have been packaged under a neutral, non-reacting gas, 
such as nitrogen. Ampoules may consist of any suitable material, such as glass, organic 
polymers, such as polycarbonate, polystyrene, etc., ceramic, metal or any other material 
typically employed to hold reagents. Other examples of suitable containers include 
simple bottles that may be fabricated from similar substances as ampules, and envelopes, 
that may consist of foil-lined interiors, such as aluminum or an alloy. Other containers 
include test tubes, vials, flasks, bottles, syringes, or the like. Containers may have a 
sterile access port, such as a bottle having a stopper that can be pierced by a hypodermic 
injection needle. Other containers may have two compartments that are separated by a 
readily removable membrane that upon removal permits the components to mix. 
Removable membranes maybe glass, plastic, rubber, etc. 

(b) Instructional materials 

Kits may also be supplied with instructional materials. Instructions may be 
printed on paper or other substrate, and/or may be supplied as an electronic-readable 
medium, such as a floppy disc, CD-ROM, DVD-ROM, Zip disc, videotape, audiotape, 
etc. Detailed instructions may not be physically associated with the kit; instead, a user 
may be directed to an internet web site specified by the manufacturer or distributor of the 
kit, or supplied as electronic mail. 

Screening and detection methods 

The isolated nucleic acid molecules of the invention can be used to express an 
AAP (e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect an AAP mRNA (e.g. 9 in a biological sample) or a genetic lesion in 
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an AAP> and to modulate AAP activity, as described below. In addition, AAP 
polypeptides can be used to screen drugs or compounds that modulate the AAP activity or 
expression as well as to treat disorders characterized by insufficient or excessive 
production of an AAP or production of AAP forms that have decreased or aberrant 
activity compared to an AAP wild-type protein, or modulate biological function that 
involve an AAP. In addition, the anti-AAP Abs of the invention can be used to detect and 
isolate an AAP and modulate AAP activity. 
1 . Screening assays 

The invention provides a method (screening assay) for identifying modalities, i.e., 
candidate or test compounds or agents (e.g., peptides, peptidomimetics, small molecules 
or other drugs), foods, combinations thereof; etc., that effect an AAP, a stimulatory or 
inhibitory effect, inlcuding translation, transcription, activity or copies of the gene in 
cells. The invention also includes compounds identified in screening assays. 

Testing for compounds that increase or decrease AAP activity are desirable. A 
compound may modulate an AAP activity by affecting: (1) the number of copies of the 
gene in the cell (amplifiers and deamplifiers); (2) increasing or decreasing transcription of 
an AAP (transcription up-regulators and down-regulators); (3) by increasing or decreasing 
the translation of an AAP mRNA into protein (translation up-regulators and down- 
regulators); or (4) by increasing or decreasing the activity of an AAP itself (agonists and 
antagonists). 

(a) effects of compounds 

To identify compounds that affect an AAP at the DNA, RNA and protein levels, 
cells or organisms are contacted with a candidate compound and the corresponding 
change in an AAP DNA, RNA or protein is assessed (Ausubel et aL, 1987). For DNA 
amplifiers and deamplifiers, the amount of an AAP DNA is measured, for those 
compounds that are transcription up-regulators and down-regulators the amount of an 
AAP mRNA is determined; for translational up- and down-regulators, the amount of an 
AAP polypeptide is measured Compounds that are agonists or antagonists may be 
identified by contacting cells or organisms with the compound, and then examining, for 
example, the model of angiogenesis in vitro. 

In one embodiment, many assays for screening candidate or test compounds that 
bind to or modulate the activity of an AAP or polypeptide or biologically-active portion 
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are available. Ttest compounds can be obtained using any of the numerous approaches in 
combinatorial library methods, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection The biological library approach is 
limited to peptides, while the other four approaches encompass peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, 1997), 

(b) small molecules 

A "small molecule" refers to a composition that has a molecular weight of less 
than about 5 kD and more preferably less than about 4 kD, most preferably less than 0.6 
kD. Small molecules can be, nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 
and can be screened with any of the assays of the invention Examples of methods for the 
synthesis of molecular libraries can be found in: (Carell et al, 1994a; Carell et al, 1994b; 
Cho et al, 1993; DeWitt et al, 1993; Gallop et al, 1994; Zuckermann et al, 1994). 

Libraries of compounds may be presented in solution (Houghten et al, 1992) or 
on beads (Lam et al, 1991), on chips (Fodor et al, 1993), bacteria, spores (Ladner et al, 
US Patent No. 5,223,409, 1993), plasmids (Cull et al, 1992) or on phage (Cwirla et al, 
1990; Devlin et al, 1990; Felici et al, 1991; Ladner et al, US Patent No. 5,223,409, 
1993; Scott and Smith, 1990). A cell-free assay comprises contacting an AAP or 
biologically-active fragment with a known compound that binds the AAP to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with the AAP, where determining the ability of the 
test compound to interact with the AAP comprises determining the ability of the AAP to 
preferentially bind to or modulate the activity of an AAP target molecule. 

(c) cell-free assays 

The cell-free assays of the invention may be used with both soluble or a 
membrane-bound forms of an AAP. In the case of cell-free assays comprising the 
membrane-bound form, a solubilizing agent to maintain the AAP in solution Examples 
of such solubilizing agents include non-ionic detergents such as n-octylglucoside, n- 
dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
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methylglucamide, TRITON® X-100 and others from the TRITON® series, THESIT®, 
IsotridecypoIy(ethylene glycol ether) n , N-dodecyl-N,N-dimethyl-3-ammonio- 1 -propane 
sulfonate, 3-{3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 3- 
(3-cholamidopropyl)dimemylanuruniol-2-hydroxy-l-propane sulfonate (CHAPSO). 
(d) immobilization of target molecules to facilitate screening 
In more than one embodiment of the assay methods, immobilizing either an AAP 
or its partner molecules can facilitate separation of complexed from uncomplexed forms 
of one or both of the proteins, as well as to accommodate high throughput assays. 
Binding of a test compound to an AAP, or interaction of an AAP with a target molecule 
in the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants, such as microliter plates, test tubes, and 
micro-centrifuge tubes. A fusion protein can be provided that adds a domain that allows 
one or both of the proteins to be bound to a matrix. For example, GST-AAP fusion 
proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose beads 
(SIGMA Chemical, St. Louis, MO) or glutathione derivatized microliter plates that are 
then combined with the test compound or the test compound and either the non-adsorbed 
target protein or an AAP, and the mixture is incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microliter plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described. Alternatively, the complexes can be 
dissociated from the matrix, and the level of AAP binding or activity determined using 
standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in 
screening assays. Either an AAP or its target molecule can be immobilized using biotin- 
avidin or biotin-streptavidin systems. Biotinylation can be accomplished using many 
reagents, such as biotin-NHS (N-hyoxoxy-succinirnide; PIERCE Chemicals, Rockford, 
IL), and immobilized in wells of streptavidin-coated 96 well plates (PIERCE Chemical). 
Alternatively, Abs reactive with an AAP or target molecules, but which do not interfere 
with binding of the AAP to its target molecule, can be derivatized to the wells of the 
plate, and unbound target or an AAP trapped in the wells by antibody conjugation. 
Methods for detecting such complexes, in addition to those described for the 
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GST-immobilized complexes, include immunodetection of complexes using Abs reactive 
with an AAP or its target, as well as enzyme-linked assays that rely on detecting an 
enzymatic activity associated with the AAP or target molecule, 
(e) screens to identify modulators 

Modulators of AAP expression can be identified in a method where a cell is 
contacted with a candidate compound and the expression of an AAP mRNA or protein in 
the cell is determined. The expression level of the AAP mRNA or protein in the presence 
of the candidate compound is compared to the AAP mRNA or protein levels in the 
absence of the candidate compound. The candidate compound can then be identified as a 
modulator of the AAP mRNA or protein expression based upon this comparison. For 
example, when expression of an AAP mRNA or protein is greater (i.e., statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the AAP mRNA or protein expression. 
Alternatively, when expression of the AAP mRNA or protein is less (statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of the AAP mRNA or protein expression. The 
level of an AAP mRNA or protein expression in the cells can be determined by methods 
described for detecting an AAP mRNA or protein, 
(i) hybrid assays 

In yet another aspect of the invention, an AAP can be used as "bait" in two-hybrid 
or three hybrid assays (Bartel et al, 1993; Brent et al, WO94/10300, 1994; Iwabuchi et 
al, 1993; Madura et al, 1993; Saifer et al, US Patent No. 5,283,317, 1994; Zervos et al, 
1993) to identify other proteins that bind or interact with the AAP and modulate AAP 
activity. Such AAP-bps are also likely to be involved in the propagation of signals by the 
AAP as, for example, upstream or downstream elements of an AAP pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
AAP is fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g., GAL4). The other construct, a DNA sequence from a library of DNA 
sequences that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
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"prey" proteins are able to interact in vivo, forming an AAP-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g. 9 LacZ) that is 
operably-linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected, and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene that 
encodes the AAP-interacting protein. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 
2. Detection assays 

Portions or fragments of an AAP cDNA sequences identified herein (and the 
complete AAP gene sequences) are useful in themselves. By way of non-limiting 
example, these sequences can be used to: (1) identify an individual from a minute 
biological sample (tissue typing); and (2) aid in forensic identification of a biological 
sample. 

(a) Tissue typing 

The AAP sequences of the invention can be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested 
with one or more restriction enzymes and probed on a Southern blot to yield unique 
bands. The sequences of the invention are useful as additional DNA markers for 
"restriction fragment length polymorphisms" (RFLP; (Smulson et al, US Patent No. 
5,272,057, 1993)). 

Furthermore, the AAP sequences can be used to determine the actual base-by-base 
DNA sequence of targeted portions of an individual's genome. AAP sequences can be 
used to prepare two PCR primers from the 5'- and 3'-termini of the sequences that can 
then be used to amplify an the corresponding sequences from an individual's genome and 
then sequence the amplified fragment 

Panels of corresponding DNA sequences from individuals can provide unique 
individual identifications, as each individual will have a unique set of such DNA 
sequences due to allelic differences. The sequences of the invention can be used to obtain 
such identification sequences from individuals and from tissue. The AAP sequences of 
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the mvention uniquely represent portions of an individual's genome. Allelic variation 
occurs to some degree in the coding regions of these sequences, and to a greater degree in 
the noncoding regions. The allelic variation between individual humans occurs with a 
frequency of about once ever 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include RFLPs. 

Each of the sequences described herein can, to some degree, be used as a standard 
agamst which DNA from an individual can be compared for identification purposes 
Because greater ™bers of polymorphisms occur m noncomng regions, fewer sequent 
are necessary to differentiate individuals. Noncoding sequences can positively identify 
individuals with a panel of 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID NOSrl, 3, 
5, 7, 9, 1 1, 13, and 15 are used, a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

Predictive medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to treat an individual prophylacticaUy. Accordingly one 
aspect of the invention relates to diagnostic assays for determining an AAP and/or nucleic 
add expression as well as AAP activity, in the context of a biological sample (e g. blood 
serum, cells, tissue) to determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant AAP expression 
or activity, including cancer. The invention also provides for prognostic (or predictive) 
assays for determining whether an individual is at risk of developing a disorder associated 
wzth an AAP, nucleic acid expression or activity. For example, mutations in andean 
be assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to prophylactically treat an individual prior to the onset of a disorder 
characterized by or associated with the AAP, nucleic acid expression, or biological 
activity. 

Another aspect of the invention provides methods for detennining AAP activity 
or nucleic acid expression, in an individual to select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics") 
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Phannacogenomics allows for the selection of modalities (e.g., drugs, foods) for 
therapeutic or prophylactic treatment of an individual based on the individual's genotype 
(e.g, the individual's genotype to determine the individual's ability to respond to a 
particular agent). Another aspect of the invention pertains to monitoring the influence of 
modalities (e.g., drugs, foods) on the expression or activity of an AAP in clinical trials. 
1. Diagnostic assays 

An exemplary method for detecting the presence or absence of an AAP in a 
biological sample involves obtaining a biological sample from a subject and contacting 
the biological sample with a compound or an agent capable of detecting the AAP or the 
AAP nucleic acid (e.g, mRNA, genomic DNA) such that the presence of the AAP is 
confirmed in the sample. An agent for detecting the AAP mRNA or genomic DNA is a 
labeled nucleic acid probe that can hybridize to the AAP mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length AAP nucleic acid, such as the nucleic 
acid of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13 or 15, or aportion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient 
to specifically hybridize under stringent conditions to an AAP mRNA or genomic DNA. 

An agent for detecting an AAP polypeptide is an antibody capable of binding to 
the AAP, preferably an antibody with a detectable label. Abs can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment (e.g., F ab or Ffrb'h) can be 
used. A labeled probe or antibody is coupled (/.*., physically linking) to a detectable 
substance, as well as indirect detection of the probe or antibody by reactivity with another 
reagent that is directly labeled. Examples of indirect labeling include detection of a 
primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a 
DNA probe with biotin such that it can be detected with fluorescently-labeled 
streptavidin. The term "biological sample" includes tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject The 
detection method of the invention can be used to detect an AAP mRNA, protein, or 
genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an AAP mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an AAP polypeptide include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of an AAP genomic DNA include 
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Southern hybridizations and fluorescence in situ hybridization (FISH). Furthermore, in 
vivo techniques for detecting an AAP include introducing into a subject a labeled anti- 
AAP antibody. For example, the antibody can be labeled with a radioactive marker 
whose presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample from the subject contains protein 
molecules, and/or mRNA molecules, and/or genomic DNA molecules. A preferred 
biological sample is blood. 

In another embodiment, the methods further involve obtaining a biological sample 
from a subject to provide a control, contacting the sample with a compound or agent to 
detect an AAP, mRNA, or genomic DNA, and comparing the presence of the AAP, 
mRNA or genomic DNA in the control sample with the presence of the AAP, mRNA or 
genomic DNA in the test sample. 

The invention also encompasses kits for detecting an AAP in a biological sample. 
For example, the kit can comprise: a labeled compound or agent capable of detecting an 
AAP or an AAP mRNA in a sample; reagent and/or equipment for determining the 
amount of an AAP in the sample; and reagent and/or equipment for comparing the 
amount of an AAP in the sample with a standard. The compound or agent can be 
packaged in a suitable container. The kit can further comprise instructions for using the 
kit to detect the AAP or nucleic acid. 

2. Prognostic assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with an aberrant 
AAP expression or activity. For example, the assays described herein, can be used to 
identify a subject having or at risk of developing a disorder associated with AAP, nucleic 
acid expression or activity. Alternatively, the prognostic assays can be used to identify a 
subject having or at risk for developing a disease or disorder. Tthe invention provides a 
method for identifying a disease or disorder associated with an aberrant AAP expression 
or activity in which a test sample is obtained from a subject and the AAP or nucleic acid 
(e.g., mRNA, genomic DNA) is detected A test sample is a biological sample obtained 
from a subject For example, a test sample can be a biological fluid (e.g., serum), cell 
sample, or tissue. 
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Pognostic assays can be used to determine whether a subject can be administered 
a modality (e.g., an agonist, antagonist, pepudomimetic, protein, peptide, nucleic acid, 
small molecule, food, etc.) to treat a disease or disorder associated with an aberrant AAP 
expression or activity. Such methods can be used to determine whether a subject can be 
effectively treated with an agent for a disorder. The invention provides methods for 
detenmmng whether a subject can be effectively treated with an agent for a disorder 
associated with an aberrant AAP expression or activity in which a test sample is obtained 
and the AAP or nucleic acid is detected (e.g., where the presence of the AAP or nucleic 
acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with the aberrant AAP expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an AAP 
to determine if a subject with the genetic lesion is at risk for a disorder characterized by 
aberrant angiogenesis. Methods include detecting, in a sample from the subject, the 
presence or absence of a genetic lesion characterized by at an alteration affecting the 
integrity of a gene encoding an AAP polypeptide, or the mis-expression of an AAP. Such 
genetic lesions can be detected by ascertaining: (1) a deletion of one or more nucleotides 
from an AAP; (2) an addition of one or more nucleotides to an AAP; (3) a substitution of 
one or more nucleotides inan^4P, (4) a chromosomal rearrangement of an AAP gene; 
(5) an alteration in the level of an AAP mRNA transcripts, (6) aberrant modification of an 
AAP, such as a change genomic DNA methylation, (7) the presence of a non-wild-type 
splicing pattern of an AAP mRNA transcript, (8) a non-wild-type level of an AAP, (9) 
allelic loss of an AAP, and/or (10) inappropriate post-translational modification of an 
AAP polypeptide. There are a large number of known assay techniques that can be used 
to detect lesions in an AAP. Any biological sample containing nucleated cells may be 
used. 

In certain embodiments, lesion detection may use a probe/primer in a polymerase 
chain reaction (PCR) (e.g, (Mullis, US Patent No. 4,683,202, 1987; Mullis et al, US 
Patent No. 4,683,195, 1987), such as anchor PCR or rapid amplification of cDNA ends 
(RACE) PCR, or, alternatively, in a ligation chain reaction (LCR) (e.g., (Landegren et al, 
1988; Nakazawa et al, 1994), the latter is particularly useful for detecting point mutations 
in AAP-genes (Abravaya et al., 1995). This method may include collecting a sample 
from a patient, isolating nucleic acids from the sample, contacting the nucleic acids with 
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one or more primers that specifically hybridize to an AAP under conditions such that 
hybridization and amplification of the^4P (if present) occurs, and detecting the presence 
or absence of an amplification product, or detecting the size of the amplification product 
and comparing the length to a control sample. It is anticipated that PCR and/or LCR may 
be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al, 1990), transcriptional amplification system (Kwoh et al, 1989); QP 
Replicase (Lizardi et al, 1988), or any other nucleic acid amplification method, followed 
by the detection of the amplified molecules using techniques well known to those of skill 
in the art These detection schemes are especially useful for the detection of nucleic acid 
molecules present in low abundance. 

Mutations in an AAP from a sample can be identified by alterations in restriction 
enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified 
(optionally), digested with one or more restriction endonucleases, and fragment length 
sizes are determined by gel electrophoresis and compared Differences in fragment 
length sizes between sample and control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

Hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high- 
density arrays containing hundreds or thousands of oligonucleotides probes, can identify 
genetic mutations in an (Cronin al, 1996;Kozale/a/., 1996). For example, 
genetic mutations in an AAP can be identified in two-dimensional arrays containing 
light-generated DNA probes as described in Cronin, et al, supra Briefly, a first 
hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
mutations. This is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 
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In yet another embodiment, any of a variety of sequencing reactions known in the 
art can be used to directly sequence an AAP and detect mutations by comparing the 
sequence of the sample AAP-with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on classic techniques (Maxam and 
Gilbert, 1977; Sanger et al, 1977). Any of a variety of automated sequencing procedures 
can be used when performing diagnostic assays (Naeve et al, 1995) including sequencing 
by mass spectrometry (Cohen et al, 1996; Griffin and Griffin, 1993; Koster, 
WO94/16101, 1994). 

Other methods for detecting mutations in an AAP include those in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al., 1985). In general, the technique of "mismatch 
cleavage" starts by providing heteroduplexes formed by hybridizing (labeled) RNA or 
DNA containing a wild-type AAP sequence with potentially mutant RNA or DNA 
obtained from a sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as those that arise from base pair 
mismatches between the control and sample strands. For instance, RNA/DNA duplexes 
can be treated with RNase and DNA/DNA hybrids' treated with S, nuclease to 
enzymatically digest the mismatched regions. In other embodiments, either DNA/DNA 
or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide and with 
piperidine in order to digest mismatched regions. The digested material is then separated 
by size on denaturing polyacrylamide gels to determine the mutation site (Grompe et al, 
1989; Saleeba and Cotton, 1993). The control DNA or RNA can be labeled for detection. 

Mismatch cleavage reactions may employ one or more proteins that recognize 
mismatched base pairs in double-stranded DNA (DNA mismatch repair) in defined 
systems for detecting and mapping point mutations in an^4PcDNAs obtained from 
samples of cells. For example, the mutY enzyme of E. coli cleaves A at G/A mismatches 
and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu 
et al, 1994). According to an exemplary embodiment, a probe based on a wild-type AAP 
sequence is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like (Modrich et al, US Patent No. 
5,459,039, 1995). 
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Electrophoretic mobility alterations can be used to identify mutations in an AAP. 
For example, single strand conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant and wild type nucleic acids 
(Cotton, 1993; Hayashi, 1992; Orita * al., ,989). Single-stranded DNA fragments of 
sample and control AAP nucleic acids are denatured and then renatured. The secondary 
structure of single-stranded nucleic acids varies according to sequence; the resulting 
alteration m electrophoretic mobility allows detection of even a single base change The 
DNA fragments may be labeled or detected with labeled probes. The sensitivity of the 
assay may be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a sequence changes. The subject method may use 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility (Keen et al, 1991). 

The migration of mutant or wild-type fragments can be assayed using denaturing 
gradient gel electrophoresis (DGGE; (Myers etal, 1985). In DGGE, DNA is modified to 
prevent complete denaturation, for example by adding a GC clamp of approximately 40 
bpofhigh-meltingGC-richDNAbyPCR A temperature gradient may also be used in 
Place of a denaturing gradient to identify differences in the mobility of control and sample 
DNA (Rossiter and Caskey, 1990). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
pnmer extensioa For example, oligonucleotide primer, may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
that permit hybridization only if a perfect match is found (Saiki et al., 1986; SsM et al 
1989). Such allele-specific oligonucleotides are hybridized to PCR-amplified target DNA 
or a number of different mutations when the oligonucleotides are attached to the 
hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 
PCR amplication may be used. Oligonucleotide primers for specific amphfications may 
carry the mutation of interest in the center of the molecule (so that amplification depends 
on differential hybridization (Gibbs et al. , 1 989)) or at the extreme 3'-terminus of one 
pnmer where, under appropriate conditions, mismatch can prevent, or reduce polymerase 
extension (Prosser, ,993). Novel restriction site in the region ofthe mutation may be 



WO 01/70808 



PCT/US01/09609 



126- 



introduced to create cleavage-based detection (Gasparini * al, 1992). Certain 
amplification may also be performed using Taq ligase for amplification (Barany, 1991). 
In such cases, ligation occurs only if there is a perfect match at the 3'-terminus of the 5' 
sequence, allowing detection of a known mutation by scoring for amplificatioa 

The described methods may be performed, for example, by using pre-packaged 
kits comprising at least one probe (nucleic acid or antibody) that may be conveniently 
used, for example, in clinical settings to diagnose patients exhibiting symptoms or family 
history of a disease or illness involving an AAP. 

Furthermore, any cell type or tissue in which an AAP is expressed may be utilized 
in the prognostic assays described herein. 
3 . Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on AAP activity 
or expression, as identified by a screening assay can be administered to individuals to 
treat, prophylactically or therapeutically, disorders, including those associated with 
anagenesis. In conjunction with such treatment, the pharmacogenomics (i.e., the study 
of the relationship between a subject's genotype and the subject's response to a foreign 
modality, such as a food, compound or drug) may be considered Metabolic differences 
of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation 
between dose and blood concentration of the pharmacologically active drug Thus, the 
pharmacogenomics of the individual permits the selection of effective agents f**., drugs) 
for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Pharmacogenomics can further be used to determine appropriate dosages and 
therapeutic regimens. Accordingly, the activity of an AAP, expression of an^4P nucleic 
add, or an^/> mu tation( S ) in an individual can be determined to guide the selection of 
appropriate agent(s) for therapeutic or prophylactic treatment. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to modalities due to altered modality disposition and abnormal action in affected 
persons (Eichelbaum and Evert, 1996; Linder et al, 1997). In general, two 
pharmacogenetic conditions can be differentiated: (1) genetic conditions transmitted as a 
single factor altering the interaction of a modality with the body (altered drug action) or 
(2) genetic conditions transmitted as single factors altering the way the body acts on a 
modality (altered drug metabolism). These phannacogenetic conditions can occur either 
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as rare defects or a, nucleic acid polymorphisms. F „ r mmpk _ glucose-e.pfc.sphate 
d^enase^D,^^^^^^^^^^^ 

cluneal complication is hemolysis after ingestion of oxidant drugs (artfrmalarial, 
aufonanndes, analgesics, nitrofurans) and consumption of fava beans 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of bom the intensify and duranon of drug actio. The discovery of 
geaetic polymorphisms of drug metebohang wymes ^ ^ % 
2) and cytochrome P450 enzymes CYP2D6 at», CYP2C19) explains me phenomena of 

the standard and safe doso of a drug. These polymotphisms areexpresscdint.ro 

" " POpUla,i0n - * e — b. — «*» (BM) and poor metabolizer (PM) 
™cpr^a,enceofPMisdifferen«amongdif,eren,pop Ul a«ions. For example, the 
™ " P ° lyn " > ' hic md — ' —» We been identified in PM 
™ Mtoto ^°''tacS„na 1 CYP2 D «. Poor metabouzers due to mutant ' 
^andCH-^tequently experience exaggerated drugresponses and side 

PMsWno therapeutic response, as demonstrated for the analgesic effect of codeine 
med^by iBC ^ f0 ^ me ^ te— ^^^^^^^ 

caUedulha-mpidmehteh^whoareunresponsivetostandarddoses. Recently me 
molecular basisofultra-rapidmetabolism has been identic to be due to CYP2D6 gene 

Jdto apply geno^ingofpo^,^^^^^ 

me .demarcation of an individual's drug responsiveness phenotyp* This k.ow.edge 

when applied to dosing or deselection, can avoid adverse reactions or therapeutic' 

^^^^^^^^^^ 

withan AAP modulator, such as a modutator identified by one of the described 

exemplary screening assays. 

4- Monitoring effects during clinical trials 
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Monitoring the influence of agents (e.g, drugs, compounds) on the expression or 
actmty of an AAP (e.g., the ability to modulate anagenesis) can be applied not only in 
baac drug screening, but also in clinical trials. For example, the effectiveness of an agent 
deternnned by a screening assay to increase an AAP expression, protein levels or up- 
regulate an AAP's activity can be monitored in clinical trails of subjects exhibiting 
decreased AAP expression, protein levels, or down-regulated AAP activity. Alternatively 
the effectiveness of an agent determined to decrease an AAP expression, protein levels or' 
down-regulate an AAP's activity, can be monitored in clinical trails of subjects exhibiting 
increased the AAP expression, protein levels, or up-regulated AAP activity. Ia such 
chmcal tnals, the expression or activity ofthe AAP and, preferably, other genes that have 
been unphcated in, for example, anagenesis can be used as a "read out" or markers for a 
particular cell's responsiveness. 

For example, genes, including an AAP, that are modulated in cells by treatment 
withamodality^.food, compound, drug or small molecule) can be identified. To 
study the effect of agents on angiogenesis, for example, in a clinical trial, cells can be 
isolated and RNA prepared and analyzed for the levels of expression of an AAP and other 
genes nnplicated in the disorder. The gene expression pattern can be quantified by 
Normem blot analysis,™^ 

amount of protein, or by measuring the activity level ofthe AAP or other gene products 
m tins manner, the gene expression pattern itself can serve as a marker, indicative ofthe 
cellular physiological response to the agent Accordingly, this response state may be 
determined before, and at various points during, treatment ofthe individual with the 
agent. 

The invention provides a method for monitoring the effectiveness of treatment of 
a subject wuh an agent (e.g., an agonist, antagonist, protein, peptide, peptidomimetic 
nuclexc acid, small molecule, food or other drug candidate identified by the screening 
assays described herein) comprising the steps of (1) obtaining a P re-administration 
sample from a subject; (2) detecting the level of expression of anAAP mRNA or 
genomic DNA in the preadrninistration sample; (3) obtaining one or more post- 
adrmmstration samples from the subject; (4) detecting the level of expression or activity 
ofthe AAP, mRNA, or genomic DNA in the post-administration samples; (5) comparing 
the level of expression or activity ofthe AAP, mRNA, or genomic DNA in the pre- • 
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admuustration sample with the AAP, mRNA, or genomic DNA in the post administration 
sample or samples; and (6) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to 
increase the expression or activity of the AAP to higher levels than detected, / e to 
increase the effectiveness of the agent Alternatively, decreased administration of the 
agent may be desirable to decrease expression or activity of the AAP to lower levels than 
detected, i.e., to decrease the effectiveness of the agent. 

5. Methods of treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant AAP expression or activity. Furthermore, these same methods of treatment may 
be used to mducc or inhibit angiogenesis, by changing the level of expression or activity 
of an AAP. 

6. Disease and disorders 

Diseases and disorders that are characterized by increased AAP levels or 
biological activity may be treated with therapeutics that antagonize (, e., reduce or inhibit) 
activity. Antognists may be administered in a therapeutic or prophylactic manner 
Therapeutics that may be used include: (1) AAP peptides, or analogs, derivatives 
fragments or homologs thereof; (2) Abs to an AAP peptide; (3) AAP nucleic acids- (4) 
administration of antisense nucleic acid and nucleic acids that are "dysmnctional" (/ e 
due to a heterologous insertion within the coding sequences) that are used to eliminate' 
endogenous function of by homologous recombination (Capecchi, 1989); or (5) 
modulators (,e., inhibitors, agonists and antagonists, including additional peptide mimetic 
of the mvention or Abs specific to an AAP) that alter the interaction between an AAP and 
its binding partner. 

Diseases and disorders that are characterized by decreased AAP levels or 
biological activity may be treated with therapeutics that increase (i.e., are agonists to) 
actmty. Therapeutics that upregulate activity may be administered therapeutically or 
prophylactically. Therapeutics that may be used include peptides, or analogs, derivatives 
fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (,g., from biopsy tissue) and assaying 
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m vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
AAP mRNAs). Methods include, but are not limited to, immunoassays (e.g., by Western 
blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) 
polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization 
assays to detect expression ofmKNAs (e.g., Northern assays, dot blots, /„ situ 
hybridization, and the like). 

7. Prophylactic methods 

The invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant AAP expression or activity, by administering an 
agent that modulates an AAP expression or at least one AAP activity. Subjects at risk for 
a isease that is caused or contributed to by an aberrant AAP expression or activity can be 
identified by, for example, any or a combination of diagnostic or prognostic assays 
Admnustration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the AAP aberrancy, such that a disease or disorder is prevented or 
alternately, delayed in its progression. Depending on the type of AAP aberrancy for 

exampl^anAAPagonistorAAPantagonistcanbeusedtotreatthesubject The ' 
appropriate agent can be determined based on screening assays. 

8. Therapeutic methods 

Another aspect of the invention pertains to methods of modulating an AAP 
expression or activity for therapeutic purpose, The modulatory method of the invention 
uwolves contacting a cell with an agent that modulates one or more of the activities of 
AAP activity associated with the celL An agent that modulates AAP activity can be a 
nucleic acid or a protein, a naturally occurring cognate ligand of an AAP, a peptide an 
AAP peptidomimetic, or other small molecule. The agent may stimulate AAP activity. 
Examples of such stimulatory agents include an active AAP and an^Pnucleic acid 
molecule that has been introduced into the cell. In anomer embodiment, me agent inlubits 
AAP activity. Examples of inhibitory agents include antisenseA4/> nucleic acids and 
anti-AAP Ab, Modulatory methods can be performed ,„ vitro (e. g ., by culturing the cell 
with the agent) or, alternatively, /„ vivo {e . g „ by administering ^ ageQt tQ a ^ ^ 
such, the mvention provides methods of treating an individual afflicted with a disease or 
drsorder characterized by aberrant expression or activity of an AAP or nucleic acid 
molecule. In one embodiment, the method involves administering an agent an agent 
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idenhfied by a screening assay), or combination of agents that modulates {eg. up - 
regulatesordown-regul^AAPexpressionoractivity. In another embodiment the 
method involves administering an AAP or nucleic acid molecule as therapy to 
compensate for reduced or aberrant AAP expression or activity. 

Stimulation of AAP activity is desirable in situations in which AAP is abnormally 
down-regulated and/or in which increased AAP activity is likely to have a beneficial 
effect. One example of such a situation is where a subject has a disorder characterized by 
aberrant angiogenesis (e.g, cancer). 

9. Determination of the biological effect of the therapeutic 
Suitable in vitro or in vivo assays can be performed to determine the effect of a 
specie therapeutic and whether its administration is indicated for treatment of the 

afifected tissue. 

to various specific embodiment, /„ viiro assays may be performed with 
represent, ceils of <he ») involved in the panenfs disorder, «o determine if. 
g.ven therapeutic exerts the desired effect up™ tbe cell type<s). Modalities for use in 
ttaapy ma, be tested in suitable animal mode, systems shading, but no, limited to rats 
"nee, clucken, cows, monkeys, rabbits, and the tike, prior ,o testing in human subjects. ' 
Snmlariy, for in viro testing, any of the animal model system known in the art may be 
used pnor to administration to human subjects. 

10. Prophylactic and therapeutic uses of the compositions of the invention 
AAP nuctac acids and proteins are useful in potential prophylactic and 
tap**, apphcations implicated in a variety of disorders including, but no, limited to 
those related to angiogenesis. 

As an example, a cDNA encoding an AAP may be useful in gene therapy, and the 
pratem maybe useful when administered to a subject in need thereof 

AAP nucleic acids, or fragments thereof, may also be useful in diagnostic 
apphcations, wherein the present or amount of to nuc..ic acid or the protein is to be 
assessed. A further use could be as an anti-bacterial molecule (i.e., some peptides have 
been font* to possess Abacterial properties). These materials are further useful in me 

generaUonof Abs ma, inrntunospecmcaDybind^menovdsubs^cesofme invention 
for use m therapeutic or diagnostic methods. 

The following example is meant to not be limiting. 
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Example 



Identification of genes differentially-regulated 

A comprehensive mRNA profiling technique (GeneCaUing) was used to 
detenmne differential gene expression profiles of human endothelial cells undergoing 
dtferenfiafion^ 

data from GeneCalhng, independent experiments were undertaken that used gene-specific 
PCRohgonucleotideprimerpah. and an oligonucleotide probe labeled with a fluorescent 
dye at the 5' end and quencher fluorescent dye at the 3' end.. Total RNA (50 ng) was 
added to a 50 fil RT-PCR mixture and run. 
The following data were collected: 

J^G collagengel24hrversus4h 4.5 fold upregulated 

KLP collagen gel 24 hr versus 4h 3.5 fold updated 

myosinX ^genge 24 hr versus 4h 3.5 fold upreg^aS 

S collagen gel 24 hr versus 4h 3.5 fold upreguLted 

^ZF SE22i rmi "J 7.3 fold do'w^Sed 

collagen gel 24 hr versus 4h 2.1 fold upregulated 



Equivalents 

Although partioular embodiments have been disclosed herein in detail this has 
been done by way of example for purposes of illustration only, and is not interred to be 
^gwimrespecttomesc^eoftheappend^ 

contemplated by me inventors .bat various substations, alterations, and mooificanons ' 

T« e rf totatoVenta ™* Md ^ tatte * iri, - d ^»f^favenSo n 
asdefinedbythectaims. Tne choice of nucleic acid storting materia!, clone of interest or 

wm knowledge of the embodiment described hereto. Omer aspect, advantages and 
modifications considered within the scope of the following claims 
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CLAIMS 

1 . An isolated polypeptide comprising an amino acid sequence having at least 
80% sequence identity to the sequence of SEQ ID NOS:2. 4, 6, 8, 10, 12, 14 or 16. 

2. The polypeptide of claim 1, wherein said polypeptide is an active AAP 
polypeptide. 

3. The polypeptide of claim 2, wherein said amino acid sequence has at least 
90% sequence identity to the sequence of SEQ ID NOS:2. 4, 6, 8, 10, 12, 14 or 16. 

4. The polypeptide of claim 2, wherein said amino acid sequence has at least 
98% sequence identity to the sequence of SEQ ID NOS:2. 4, 6, 8, 10, 12, 14 or 16. 

5. An isolated polynucleotide encoding the polypeptide of any one of claims 
1-4, or a complement of said polynucleotide. 

6. An isolated polynucleotide comprising a nucleotide sequence having at 
least 80% sequence identity to the sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15, or 
a complement of said polynucleotide. 

7. The polynucleotide of claim 6, wherein said nucleotide sequence has at 
least 90% sequence identity to the sequence of NOS:l, 3, 5, 7, 9, 1 1, 13 or 15 , or a 
complement of said polynucleotide. 

8. The polynucleotide of claim 6, wherein said nucleotide sequence has at 
least 98% sequence identity to the sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, or 
a complement of said polynucleotide. 



1-4. 



9. 



antibody that specifically binds to the polypeptide of any one of claims 
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10. 



A method of modulating angiogenesis comprising modulating the activity 
of at least one AAP. 



11. The method of claim 10 wherein said modulating angiogenesis is 
increasing angiogenesis, and said modulating the activity comprises increasing the 
activity of at least one polypeptide selected from the group consisting of KLP, hBAZF, 
hTRG, hMXl, hMX2, hEF-G, and hMP. 

12. The method of claim 10 wherein said modulating angiogenesis is 
decreasing angiogenesis, and said modulating the activity comprises increasing the 
activity of at least one polypeptide, wherein said at least one polypeptide comprises NHR. 



; is 



13. The method of claim 10 wherein said modulating angiogenesis i 
decreasing angiogenesis, and said modulating the activity comprises decreasing the 
activity of at least one polypeptide selected from the group consisting of KLP, hBAZF, 
hTRG, hMXl, hMX2, hEF-G, and hMP. 

14. The method of claim 1 0 wherein said modulating angiogenesis is 
increasing angiogenesis, and said modulating the activity.comprises decreasing the 
activity of at least one polypeptide, wherein said at least one polypeptide comprises NHR. 

15. The method of claim 1 1 or 12 wherein said increasing activity comprises 
increasing the expression of said at least one polypeptide. 



16. The method of claim 13 or 14 wherein said decreasing activity comprises 
decreasing the expression of said at least one polypeptide. 

17. The method of claim 15 wherein said increasing expression comprises 
transforming a cell to increase expression of a polynucleotide encoding said at least one 
polypeptide. 
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18. The method of claim 16 wherein said decreasing expression comprises 
traiisfonning a cell to express a polynucleotide anti-sense to at least a portion of an 
endogenous polynucleotide encoding said at least one polypeptide. 

19. The method of claim 13 or 14 wherein said decreasing activity comprises 
traiisforming a cell to express an aptamer to said at least one polypeptide. 

20. The method of claim 13 or 14 wherein said decreasing activity comprises 
introducing into a cell an aptamer to said at least one polypeptide. 

21. The method claim 13or 14 wherein said decreasing activity comprises 
administering to a cell an antibody that selectively binds to said at least one polypeptide. 

22. A method of treating tumors comprising decreasing angiogenesis by the 
method of claim 12 or 13. 



23. A method of treating cancer comprising treating a cancerous tumor by the 
method of claim 22. 



24. Amethod of treating myocardial infarction comprising increasing 
angiogenesis by the method of claim 1 1 or 14. 



25. A method of promoting healing comprising increasing angiogenesis by the 
method of claim 11 or 14. 



26. A method for determining whether a compound up-regulates or down- 
regulates the transcription of an AAP gene, comprising: 

contacting said compound with a composition comprising a RNA polymerase and 
said gene and measuring the amount of said AAP gene transcription. 

27. The method of claim 26, wherein said composition is in a cell. 
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28. A method for determining whether a compound up-regulates or down- 
regulates the translation of an AAP gene, comprising: 

contacting said compound with a composition comprising a ribosome and a 
polynucleotide corresponding to a mRNA of said gene and measuring the amount of said 
AAP gene translation. 

29. The method of claim 28, wherein said composition is in a cell. 

30. A vector, comprising the polynucleotide of any one of claims 5-8. 

31. A cell, comprising the vector of claim 30. 

32. A method of screening a tissue sample for tumorigenic potential, 
comprising: 

measuring expression of at least one AAP gene in said tissue sample. 

33. The method of claim 32, wherein said measuring is measuring an amount 
of a polypeptide encoded by said at least one AAP gene. 

34. The method of claim 32, wherein said measuring expression is measuring 
an amount of mRNA corresponding to said at least one AAP gene. 



35. A transgenic non-human animal, having at least one disrupted AAP 



gene. 



36. The transgenic non-human animal of claim 35, wherein the non-human 
animal is a mouse. 



37. A transgenic non-human animal, comprising an exogenous polynucleotide 
having at least 80% sequence identity to the sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 
or 15 , or a complement of said polynucleotide. 
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38. The transgenic non-human animal of claim 37, wherein said exogenous 
polynucleotide has at least 90% sequence identity to the sequence ofSEQ ED NOS:l, 3, 5, 
7, 9, 1 1, 13 or 15 , or a complement of said polynucleotide. 

39. The transgenic non-human animal of claim 37, wherein said exogenous 
polynucleotide has at least 98% sequence identity to the sequence ofSEQ LD NOS.l, 3, 5, 
7, 9, 11, 13 or 15 , or a complement of said polynucleotide. 

40. A method of screening a sample for an AAP gene mutation, comprising: 
comparing an AAP nucleotide sequence in the sample with SEQ ID NOS-1 3 5 

7, 9, 11, 13 or 15. 

41. A method of determining the clinical stage of tumor comprising comparing 
expression of at least one AAP gene in a sample with expression of said at least one gene 
in control samples. 



